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GenCore version 6.2.1 
Copyright (c) 1993 - 2008 Biocceleration Ltd. 



OM protein - protein search, using sw model 



June 24, 2008, 08:32:16 



Search time 263 Seconds 
(without alignments) 
2135.186 Million cell updates/sec 



Title: 

Perfect score: 
Sequence : 



US-10-552-515-1 

4950 

1 MRMAATAWAGLQGPPLPTLC SELSSHWTPFTVPKASQLQQ 933 



Scoring table: 



BLOSUM62 

Gapop 10.0 , Gapext 0.5 



Searched: 3405708 seqs, 601879884 residues 

Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



Database : A_Geneseq_2 0 0 711 : * 

1: geneseqpl980s : * 

2: geneseqpl990s : * 

3: geneseqp2000 : * 

4: geneseqp2001 : * 

5: geneseqp2002 : * 

6: geneseqp2003a : * 

7: geneseqp2003b : * 

8: geneseqp2004a : * 
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9 : 

10: 

11: 

12: 



geneseqp2 004b : 
geneseqp2005 : 
geneseqp2006 : 
geneseqp2007 : 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



Result Query 

No. Score Match Length DB ID Description 



1 


4950 


100 


0 


933 


8 


ADT77664 


Adt77664 Splice va 


2 


4950 


100 


0 


933 


11 


AEL84788 


Ael84788 Tumor mar 


3 


4531.5 


91 


5 


885 


10 


AEB13426 


Aebl3426 Human pro 


4 


4364 . 5 


88 


2 


843 


10 


AEB13424 


Aebl3424 Human pro 


5 


3736 


75 


5 


898 


4 


ABG15488 


Abgl5488 Novel hum 


6 


1531 . 5 


30 


9 


920 


6 


ADB64420 


Adb6 4 42 0 Human pro 


7 


1511 . 5 


30 


5 


920 


6 


ABP586 66 


Abp5 866 6 Human dih 


8 


1504 


30 


4 


981 


8 


ADK52114 


Adk52114 Human ato 


9 


1504 


30 


4 


981 


12 


AEN06206 


Aen06206 Human eso 


10 


1489 


30 


1 


1017 


12 


AFB77190 


Afb77190 Mouse TM- 


11 


1488 


30 


1 


960 


11 


AEG11142 


Aeglll42 Human tra 


12 


1479 . 5 


29 


9 


840 


11 


AEG11146 


Aeglll46 Human tra 


13 


1464 


29 


6 


1003 


7 


ADG48280 


Adg4828 0 Human ret 


14 


1455 


29 


4 


913 


11 


AEH82071 


Aeh82071 Human gna 


15 


1445 


29 


2 


1219 


4 


ABB62812 


Abb62812 Drosophil 


16 


1445 


29 


2 


1219 


10 


AFB95185 


Afb95185 Fruit fly 


17 


1402 . 5 


28 


3 


910 


6 


ADC42854 


Adc42854 REMAP pro 


18 


1402 . 5 


28 


3 


910 


11 


AEL84658 


Ael84658 Tumor mar 


19 


1369 . 5 


27 


7 


1075 


4 


ABB65993 


Abb65993 Drosophil 


20 


1369 . 5 


27 


7 


1075 


10 


AFC04729 


Afc04729 Fruit fly 


21 


1367.5 


27 


6 


712 


11 


AEG11145 


Aeglll45 Human tra 


22 


1199.5 


24 


2 


1058 


4 


ABB65022 


Abb65022 Drosophil 


23 


1199 . 5 


24 


2 


1058 


10 


AFC01816 


Afc01816 Fruit fly 


24 


1154 


23 


3 


596 


6 


ADB64387 


Adb6 438 7 Human pro 


25 


1061.5 


21 


4 


594 


4 


AAB92637 


Aab9263 7 Human pro 


26 


1061.5 


21 


4 


594 


5 


ABP43811 


Abp43811 FLJ10261 


27 


1061.5 


21 


4 


594 


8 


ADJ75429 


Ad j 75429 Marker ge 


28 


1061.5 


21 


4 


594 


8 


ADN04848 


Adn04848 Antipsori 


29 


1061 . 5 


21 


4 


594 


11 


AEG11143 


Aeglll43 Human FLJ 


30 


1037 . 5 


21 


0 


782 


6 


ADX42387 


Adx42387 Human col 


31 


1037.5 


21 


0 


782 


7 


ADT95905 


Adt95905 Colon can 


32 


1037.5 


21 


0 


782 


8 


ADQ96288 


Adq96288 T cell ac 


33 


1037.5 


21 


0 


782 


8 


ADQ96104 


Adq96104 T cell ac 


34 


912.5 


18 


4 


475 


6 


ADB64962 


Adb6 4962 Human pro 


35 


905 


18 


3 


642 


7 


ADM05798 


Adm05 798 Human pro 
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36 


905 


18 , 


.3 


642 


10 


AEC88728 


37 


905 


18 , 


.3 


642 


11 


AEG11144 


38 


819 . 5 


16 , 


. 6 


443 


5 


ABP41785 


39 


817 . 5 


16 , 


. 5 


179 


6 


AA029613 


40 


784 . 5 


15 , 


. 8 


390 


5 


ABB90382 


41 


735 


14 , 


. 8 


139 


5 


AAE24066 


42 


722 . 5 


14 , 


.6 


360 


4 


AAM40391 


43 


711 . 5 


14 , 


. 4 


346 


8 


ADP29628 


44 


695 . 5 


14 , 


■ 1 


608 


8 


ADQ96298 


45 


695.5 


14, 


. 1 


608 


8 


ADQ96286 



Aec88728 
Aeglll44 
Abp41785 
Aao29613 
Abb90382 
Aae24066 
Aam40391 
Adp29628 
Adq96298 
Adq96286 



Human cDN 
Human FLJ 
Human ova 
Human Nov 
Human pol 
Human pro 
Human pol 
Human sec 
T cell ac 
T cell ac 



ALIGNMENTS 



RESULT 1 
ADT77664 

ID ADT77664 standard; 
XX 



protein; 933 AA. 



DT 
XX 
DE 
XX 
KW 
KW 
KW 
XX 
OS 
XX 
FH 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 



FT 
FT 



15-JUN- 
13-JAN- 



2007 
2005 



(revised) 
(first entry) 



Splice variant-novel gene expressed in prostate (SV-NGEP) polypeptide. 

Splice variant-novel gene expressed in prostate; SV-NGEP; human; 
prostate cancer; cytostatic; gene therapy; immunotherapy; BOND_PC; 
NGEP long variant; NGEP long variant [Homo sapiens]; G05886. 

Homo sapiens . 



Key 

Domain 
Region 

Region 
Region 
Region 
Domain 



Location/Qualifiers 
1. .345 

/label= Cytoplasmic 
157. .933 

/note= "An immunogenic fragment comprising 8 consecutive 
amino acids that specifically binds to an antibody that 
specifixally binds to a polypeptide comprising amino 
acids 157-933 is referred to in Claim 1" 



170. , 

/note= 
215. , 

/note= 



178 

"Epitope, 
223 

"Epitope, 



258. .266 
/note= "Epitope 
346. .368 

/label= Transmembrane 
369. .421 



predicted to bind HLA2-01" 
predicted to bind HLA2-01" 
predicted to bind HLA2-01" 



http://es/ScoreAccessWeb/GetItem.action?AppId=105525...24_083145_us-10-552-515-l.rag&ItemType=4&startByte=0 (3 of 44)10/10/^^^ 



SCORE Search Results Details for Application 10552515 and Search Resuh 20080624_083145_us-10-552-515-l.rag. 



FT /label= External 

FT /note= "Cell surface" 

FT Region 403. .411 

FT /note= "Epitope, predicted to bind HLA2-01" 

FT Domain 422. .441 

FT /label= Transmembrane 

FT Region 427. .435 

FT /note= "Epitope, predicted to bind HLA2-01" 

FT Domain 442. .501 

FT /label= Cytoplasmic 

FT Domain 502. .524 

FT /label= Transmembrane 

FT Domain 525. .543 

FT /label= External 

FT /note= "Cell surface" 

FT Domain 544. .566 

FT /label= Transmembrane 

FT Region 557. .565 

FT /note= "Epitope, predicted to bind HLA2-01" 

FT Region 562. .570 

FT /note= "Epitope, predicted to bind HLA2-01" 

FT Domain 567. .586 

FT /label= Cytoplasmic 

FT Domain 587. .609 

FT /label= Transmembrane 

FT Domain 610. .714 

FT /label= External 

FT /note= "Cell surface" 

FT Domain 715. .737 

FT /label= Transmembrane 

FT Domain 738. .761 

FT /label= Cytoplasmic 

FT Domain 762. .784 

FT /label= Transmembrane 

FT Domain 785. .933 

FT /label= External 

FT /note= "Cell surface" 

FT Region 846. .854 

FT /note= "Epitope, predicted to bind HLA2-01" 

XX 



PN WO2004092213-A1 . 
XX 

PD 28-OCT-2004. 
XX 

PF 05-APR-2004; 2 0 0 4WO-US0 1 05 8 8 . 
XX 

PR 08-APR-2003; 2003US-0461399P . 
XX 

PA (USSH ) US DEPT HEALTH & HUMAN SERVICES. 
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XX 

PI Pastan I, Bera TK, Lee B; 

XX 

DR WPI; 2004-758338/74. 

DR N-PSDB; ADT77665. 

DR PC:NCBI; gi48093524. 
XX 

PT New Splice Variant-Novel Gene Expressed in Prostate polypeptide or 

PT encoding nucleic acid molecule for diagnosing, preventing or treating 

PT cancer, especially prostate cancer. 

XX 

PS Claim 1; SEQ ID NO 1; 88pp; English. 

XX 

CC The present sequence is the protein sequence of splice variant-novel gene 

CC expressed in prostate (SV-NGEP) . SV-NGEP is identical to NGEP from amino 

CC acid 1-157, diverging from amino acid 158. Expression analysis in 76 

CC normal and foetal tissues showed SV-NGEP to be strongly expressed only in 

CC a prostate sample. Claimed methods for detecting prostate cancer in a 

CC subject comprise: contacting the sample with an antibody that 

CC specifically binds a SV-NGEP polypeptide and detecting the formation of 

CC an immune complex; or detecting an increase in expression of SV-NGEP 

CC polypeptide or mRNA. Antibodies to an SV-NGEP polypeptide can be used to 

CC detect metastatic prostate cancer cells at locations other than the 

CC prostate. A claimed method for producing an immune response against a 

CC cell expressing SV-NGEP, for example in a subject with prostate cancer, 

CC comprises administering the polypeptide, or a polynucleotide encoding it, 

CC to produce an immune response that decreases growth of the prostate 

CC cancer. A claimed method for inhibiting the growth of a malignant cell 

CC that expresses SV-NGEP comprises culturing cytotoxic T lymphocytes (CTLs) 

CC with SV-NGEP to produce activated CTLs that recognise an NGEP expressing 

CC cell, and contacting the malignant cell with the activated CTLs. 

CC Alternatively, growth of a malignant cell is inhibited by contact with an 

CC antibody that specifically binds an SV-NGEP polypeptide, where the 

CC antibody is linked to an effector molecule (chemotherapeutic agent or 

CC toxin) that inhibits growth of the malignant cell. This may be performed 

CC in vivo. Kits for detecting an SV-NGEP polypeptide or polynucleotide in a 

CC sample are also claimed. 

CC 

CC Revised record issued on 15-JUN-2007 : Enhanced with precomputed 

CC information from BOND. 

XX 

SQ Sequence 933 AA; 



Query Match 100.0%; Score 4950; DB 8; Length 933; 

Best Local Similarity 100.0%; Pred. No. 0; 

Matches 933; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 MRMAATAWAGLQGPPLPTLCPAVRTGLYCRDQAHAERWAMTSETSSGSHCARSRMLRRRA 6 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
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Db 1 MRMAATAWAGLQGPPLPTLCPAVRTGLYCRDQAHAERWAMTSETSSGSHCARSRMLRRRA 6 0 



Qy 61 QEEDSTVLIDVSPPEAEKRGSYGSTAHASEPGGQQAAACRAGSPAKPRIADFVLVWEEDL 120 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 61 QEEDSTVLIDVSPPEAEKRGSYGSTAHASEPGGQQAAACRAGSPAKPRIADFVLVWEEDL 12 0 

Qy 121 KLDRQQDSAARDRTDMHRTWRETFLDNLRAAGLCVDQQDVQDGNTTVHYALLSASWAVLC 180 

M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I 

Db 121 KLDRQQDSAARDRTDMHRTWRETFLDNLRAAGLCVDQQDVQDGNTTVHYALLSASWAVLC 180 

Qy 181 YYAEDLRLKLPLQELPNQASNWSAGLLAWLGIPNVLLEVVPDVPPEYYSCRFRVNKLPRF 2 40 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 181 YYAEDLRLKLPLQELPNQASNWSAGLLAWLGIPNVLLEWPDVPPEYYSCRFRVNKLPRF 2 40 

Qy 241 LGSDNQDTFFTSTKRHQILFEILAKTPYGHEKKNLLGIHQLLAEGVLSAAFPLHDGPFKT 300 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 241 LGSDNQDTFFTSTKRHQILFEILAKTPYGHEKKNLLGIHQLLAEGVLSAAFPLHDGPFKT 300 

Qy 301 PPEGPQAPRLNQRQVLFQHWARWGKWNKYQPLDHVRRYFGEKVALYFAWLGFYTGWLLPA 36 0 

I I I I I I I I M M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 301 PPEGPQAPRLNQRQVLFQHWARWGKWNKYQPLDHVRRYFGEKVALYFAWLGFYTGWLLPA 36 0 

Qy 361 AVVGTLVFLVGCFLVFSDIPTQELCGSKDSFEMCPLCLDCPFWLLSSACALAQAGRLFDH 42 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 361 AWGTLVFLVGCFLVFSDIPTQELCGSKDSFEMCPLCLDCPFWLLSSACALAQAGRLFDH 42 0 

Qy 421 GGTVFFSLFMALWAVLLLEYWKRKSATLAYRWDCSDYEDTEERPRPQFAASAPMTAPNPI 480 

I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 421 GGTVFFSLFMALWAVLLLEYWKRKSATLAYRWDCSDYEDTEERPRPQFAASAPMTAPNPI 48 0 

Qy 481 TGEDEPYFPERSRARRMLAGSWIWMVAVWMCLVSIILYRAIMAIWSRSGNTLLAAW 540 

I I I I I I I I I I I I I I I I I M M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 481 TGEDEPYFPERSRARRMLAGSWIWMVAVWMCLVSIILYRAIMAIWSRSGNTLLAAW 540 

Qy 541 ASRIASLTGSWNLVFILILSKIYVSLAHVLTRWEMHRTQTKFEDAFTLKVFIFQFVNFY 600 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 541 ASRIASLTGSWNLVFILILSKIYVSLAHVLTRWEMHRTQTKFEDAFTLKVFIFQFVNFY 6 00 

Qy 601 SSPVYIAFFKGRFVGYPGNYHTLFGVRNEECAAGGCLIELAQELLVIMVGKQVINNMQEV 660 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 601 SSPVYIAFFKGRFVGYPGNYHTLFGVRNEECAAGGCLIELAQELLVIMVGKQVINNMQEV 66 0 

Qy 661 LIPKLKGWWQKFRLRSKKRKAGASAGASQGPWEDDYELVPCEGLFDEYLEMVLQFGFVTI 720 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 661 LIPKLKGWWQKFRLRSKKRKAGASAGASQGPWEDDYELVPCEGLFDEYLEMVLQFGFVTI 720 

Qy 721 FVAACPLAPLFALLNNWVEIRLDARKFVCEYRRPVAERAQDIGIWFHILAGLTHLAVISN 780 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 721 FVAACPLAPLFALLNNWVEIRLDARKFVCEYRRPVAERAQDIGIWFHILAGLTHLAVISN 780 
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Qy 



781 AFLLAFSSDFLPRAYYRWTRAHDLRGFLNFTLARAPSSFAAAHNRTCRYRAFRDDDGHYS 840 




Db 



781 AFLLAFSSDFLPRAYYRWTRAHDLRGFLNFTLARAPSSFAAAHNRTCRYRAFRDDDGHYS 8 40 



Qy 



841 QTYWNLLAIRLAFVIVFEHWFSVGRLLDLLVPDIPESVEIKVKREYYLAKQALAENEVL 9 0 0 




Db 



841 QTYWNLLAIRLAFVIVFEHWFSVGRLLDLLVPDIPESVEIKVKREYYLAKQALAENEVL 9 0 0 



Qy 



901 FGTNGTKDEQPKGSELSSHWTPFTVPKASQLQQ 933 




Db 



901 FGTNGTKDEQPKGSELSSHWTPFTVPKASQLQQ 933 



RESULT 2 
AEL84788 

ID AEL84788 standard; protein; 933 AA. 

XX 

AC AEL8 4 78 8; 
XX 

DT 18-OCT-2007 (revised) 

DT 15-JUN-2007 (revised) 

DT 28-DEC-2006 (first entry) 
XX 

DE Tumor marker gene NGEP SEQ ID NO 155. 
XX 

KW cytostatic; diagnosis; prognosis; tumor marker; gene expression; 

KW drug screening; cancer; neoplasm; NGEP; BOND_PC; NGEP long variant; 

KW G05886. 
XX 

OS Homo sapiens . 
XX 

PN WO2006110593-A2 . 
XX 

PD 19-OCT-2006. 

XX 

PF 07-APR-2006; 2006WO-US013172 . 

XX 

PR 07-APR-2005; 2005US-0669342P . 

PR ll-OCT-2005; 2005US-0725982P . 

XX 

PA (MACR-) MACROGENICS INC. 

XX 

PI Von Haller PD, Schummer M, Meyer DW, Schubert LA, Tjoelker LW; 

XX 

DR WPI; 2006-814687/82. 

DR N-PSDB; AEL8478 7. 

DR REFSEQ; NP_001001891 . 

DR PCrNCBI; gi48093524. 
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XX 

PT Detecting or diagnosing cancer in a subject comprises determining 

PT expression of at least one gene, and comparing level of expression to a 

PT control sample from a normal subject, where increased expression level 

PT indicates cancer, 

XX 

PS Claim 8; SEQ ID NO 155; 583pp; English. 

XX 

CC The invention describes a method of detecting or diagnosing cancer in a 

CC subject comprising determining the expression level of at least one gene, 

CC and comparing the level of expression to a corresponding control sample 

CC from a normal subject, where cancer is detected or diagnosed if there is 

CC an increase in the expression level of the gene relative to the 

CC expression in the control sample. Also described are: identifying a 

CC compound to be tested for its ability to prevent, treat, manage, or 

CC ameliorate cancer or its symptom; a compound identified by the method; 

CC treating cancer in a patient; treating a cancer in a subject that is 

CC fully or partially refractory to a first treatment in a patient; and a 

CC pharmaceutical composition comprising an amount of an antibody selected 

CC from anti-SLC12A2, anti-FLJ23375, anti-GRM5, anti-TAS2Rl, anti-NRXN2, 

CC anti-C14orf 160, anti-MGC 15668, ant i-MGC33 4 86 , anti-TMEM16F, anti-FAT, 

CC anti-KIAA0195, anti-LRFN, anti-NFASC, anti-BAT2Dl, anti-MGC2963, anti- 

CC KIAA0685, anti-EDG3, anti-GGTL3, anti-PLVAP, anti-FLJ31528 , anti- 

CC FLJ90709, anti-VEZATIN, anti-TMPRSS9 , anti-ATP13A5 , anti-PKHDlLl , anti- 

CC C2orfl8, anti-ANKRD22, anti-FAM62B, anti-L0C5 716 8 , anti-CDKALl, anti- 

CC SLC39A3vl, anti-SLC39A3v2, anti-BAT5, anti-TM9SF4, anti-DC2, anti-VAPB, 

CC anti-XTP3TPB, ant i-TACSTD2 , anti-FNDC3A, anti-GKOOl, anti-0CIAD2, anti- 

CC PR01855, anti-C20orf3, anti-SDFRl, anti-FLJ20481, anti-LENG4, anti- 

CC FLJ12443, anti-ARP5 Long, anti-ARP5 Short, anti-TMD06 45 , anti-NGEP, anti- 

CC ILIRAPI, anti-PLXNBl, anti-ATP2B2, anti~FLJ11848, anti-ENTPD2, anti- 

CC PPMIH, anti-KRTKAP3, anti-KCNC3, anti-TM9SFl, anti-ULBPl, ant i-Cl 9orf 26 , 

CC anti-KIAA830, anti-KIAA1244, anti-KIAA1797, anti-MGC26856, anti-NET02, 

CC anti-SUSD2, anti-F0LR2, anti-EMR2, ENTPDl, anti-ATPlOB, anti-PTK7, anti- 

CC FLJ14681, anti-C20orf22, anti-FLJ14281, anti-FAM8Al, anti-TMED7, anti- 

CC C20orfl08, anti-ATADl, anti-GPR154, anti-C14orf 27, anti-OSAP, anti- 

CC FAD104, anti-FLJ90492, anti-SLC27A3, anti-RON, ant i-ATP13Al , anti- 

CC DKFZP564D166, anti-ESSPL, anti-EXTL3, anti-KAIl, anti-KIAA0960, anti- 

CC MTRNL, anti-SLC27Al, anti-GRIA, anti-0R4Ml, anti-KIAA16 79, or anti-UPK-lb 

CC antibody, and a pharmaceutical carrier. The methods are useful for 

CC detecting, diagnosing, and treating cancer, e.g. colon, lung, ovary, 

CC prostate, pancreas, or bladder cancer. This is the amino acid sequence of 

CC NGEP, altered levels of expression are useful in the diagnosis or 

CC prognosis of cancer. 

CC 

CC Revised record issued on 18-OCT-2007 : Enhanced with precomputed 

CC information from BOND. 

XX 

SQ Sequence 933 AA; 
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Query Match 100.0%; Score 4950; DB 11; Length 933; 

Best Local Similarity 100.0%; Pred. No. 0; 

Matches 933; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 
Qy 1 MRMAATAWAGLQGPPLPTLCPAVRTGLYCRDQAHAERWAMTSETSSGSHCARSRMLRRRA 6 0 

I I I M I M M I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I 
Db 1 MRMAATAWAGLQGPPLPTLCPAVRTGLYCRDQAHAERWAMTSETSSGSHCARSRMLRRRA 6 0 

Qy 61 QEEDSTVLIDVSPPEAEKRGSYGSTAHASEPGGQQAAACRAGSPAKPRIADFVLVWEEDL 120 

I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 61 QEEDSTVLIDVSPPEAEKRGSYGSTAHASEPGGQQAAACRAGSPAKPRIADFVLVWEEDL 12 0 

Qy 121 KLDRQQDSAARDRTDMHRTWRETFLDNLRAAGLCVDQQDVQDGNTTVHYALLSASWAVLC 18 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 121 KLDRQQDSAARDRTDMHRTWRETFLDNLRAAGLCVDQQDVQDGNTTVHYALLSASWAVLC 18 0 

Qy 181 YYAEDLRLKLPLQELPNQASNWSAGLLAWLGIPNVLLEVVPDVPPEYYSCRFRVNKLPRF 2 40 

I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 181 YYAEDLRLKLPLQELPNQASNWSAGLLAWLGIPNVLLEWPDVPPEYYSCRFRVNKLPRF 2 40 

Qy 241 LGSDNQDTFFTSTKRHQILFEILAKTPYGHEKKNLLGIHQLLAEGVLSAAFPLHDGPFKT 300 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 241 LGSDNQDTFFTSTKRHQILFEILAKTPYGHEKKNLLGIHQLLAEGVLSAAFPLHDGPFKT 300 

Qy 301 PPEGPQAPRLNQRQVLFQHWARWGKWNKYQPLDHVRRYFGEKVALYFAWLGFYTGWLLPA 36 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 301 PPEGPQAPRLNQRQVLFQHWARWGKWNKYQPLDHVRRYFGEKVALYFAWLGFYTGWLLPA 36 0 

Qy 361 AWGTLVFLVGCFLVFSDIPTQELCGSKDSFEMCPLCLDCPFWLLSSACALAQAGRLFDH 42 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 361 AVVGTLVFLVGCFLVFSDIPTQELCGSKDSFEMCPLCLDCPFWLLSSACALAQAGRLFDH 42 0 

Qy 421 GGTVFFSLFMALWAVLLLEYWKRKSATLAYRWDCSDYEDTEERPRPQFAASAPMTAPNPI 480 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 421 GGTVFFSLFMALWAVLLLEYWKRKSATLAYRWDCSDYEDTEERPRPQFAASAPMTAPNPI 48 0 

Qy 481 TGEDEPYFPERSRARRMLAGSVVIWMVAVVVMCLVSIILYRAIMAIWSRSGNTLLAAW 5 40 

I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I 
Db 481 TGEDEPYFPERSRARRMLAGSWIWMVAVWMCLVSIILYRAIMAIWSRSGNTLLAAW 540 

Qy 541 ASRIASLTGSVVNLVFILILSKIYVSLAHVLTRWEMHRTQTKFEDAFTLKVFIFQFVNFY 600 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I 
Db 541 ASRIASLTGSVVNLVFILILSKIYVSLAHVLTRWEMHRTQTKFEDAFTLKVFIFQFVNFY 600 

Qy 601 SSPVYIAFFKGRFVGYPGNYHTLFGVRNEECAAGGCLIELAQELLVIMVGKQVINNMQEV 66 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M M 
Db 601 SSPVYIAFFKGRFVGYPGNYHTLFGVRNEECAAGGCLIELAQELLVIMVGKQVINNMQEV 66 0 

Qy 661 LIPKLKGWWQKFRLRSKKRKAGASAGASQGPWEDDYELVPCEGLFDEYLEMVLQFGFVTI 72 0 
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Db 



661 LIPKLKGWWQKFRLRSKKRKAGASAGASQGPWEDDYELVPCEGLFDEYLEMVLQFGFVTI 72 0 



Qy 



721 FVAACPLAPLFALLNNWVEIRLDARKFVCEYRRPVAERAQDIGIWFHILAGLTHLAVISN 78 0 




Db 



721 FVAACPLAPLFALLNNWVEIRLDARKFVCEYRRPVAERAQDIGIWFHILAGLTHLAVISN 78 0 



Qy 



781 AFLLAFSSDFLPRAYYRWTRAHDLRGFLNFTLARAPSSFAAAHNRTCRYRAFRDDDGHYS 8 40 




Db 



781 AFLLAFSSDFLPRAYYRWTRAHDLRGFLNFTLARAPSSFAAAHNRTCRYRAFRDDDGHYS 840 



Qy 



841 QTYWNLLAIRLAFVIVFEHWFSVGRLLDLLVPDIPESVEIKVKREYYLAKQALAENEVL 90 0 




Db 



841 QTYWNLLAIRLAFVIVFEHWFSVGRLLDLLVPDIPESVEIKVKREYYLAKQALAENEVL 90 0 



Qy 



901 FGTNGTKDEQPKGSELSSHWTPFTVPKASQLQQ 933 




Db 



901 FGTNGTKDEQPKGSELSSHWTPFTVPKASQLQQ 933 



RESULT 3 

AEB13426 

ID AEB13426 standard; protein; 885 AA. 
XX 

AC AEB13426; 
XX 

DT 22-SEP-2005 (first entry) 
XX 

DE Human prostate specific polypeptide #2. 
XX 

KW Screening; diagnosis; drug delivery; prostate specific polypeptide; 

KW cancer; prostate tumor; cytostatic; neoplasm. 

XX 

OS Homo sapiens . 
XX 

PN WO2005062788-A2 . 
XX 

PD 14-JUL-2005. 
XX 

PF 16-DEC-2004; 2004WO-US042406 . 

XX 

PR 22-DEC-2003; 2003US-0531809P . 

XX 

PA (AVAL-) AVALON PHARM INC. 

XX 

PI Weigle B, Ebner R; 
XX 

DR WPI; 2005-497793/50. 

DR N-PSDB; AEB13425. 
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XX 

PT Novel isolated prostate specific polypeptide, useful for treating cancer, 

PT and identifying agent that modulates activity of cancer related gene. 

XX 

PS Claim 12; SEQ ID NO 5; 59pp; English. 

XX 

CC The invention relates to an isolated prostate specific polypeptide 

CC comprising one or more immunogenic fragments. The invention also relates 

CC to a method of identifying an agent that modulates the activity of a 

CC cancer related gene involving contacting a compound with a cell 

CC containing a gene under conditions promoting the expression of the gene, 

CC detecting a difference in expression of the gene relative to when the 

CC compound is not present and identifying an agent that modulates the 

CC activity of a cancer related gene, a method of identifying an anti- 

CC neoplastic agent involving contacting a cell exhibiting neoplastic 

CC activity with a compound first identified as a cancer related gene 

CC modulator using and determining a decrease in neoplastic activity after 

CC contacting, when compared to when the contacting does not occur, or 

CC administering an agent first identified to an animal exhibiting a cancer 

CC condition and detecting a decrease in cancerous condition, a method of 

CC determining the cancerous status of a cell involving determining an 

CC increase in the level of expression in a cell of a gene where an elevated 

CC expression relative to a known non-cancerous cell indicates a cancerous 

CC state or potentially cancerous state, an antibody that reacts with a 

CC prostate specific polypeptide, an immunocon jugate comprising the antibody 

CC and a cytotoxic agent, a method of treating cancer involving contacting a 

CC cancerous cell in vivo with an agent having activity against a prostate 

CC specific polypeptide and an immunogenic composition the prostate specific 

CC polypeptide. The prostate specific polypeptide is useful for identifying 

CC an agent that modulates the activity of a cancer related gene. The 

CC immunogenic composition is useful for treating cancer, preferably 

CC prostate cancer in an animal, e.g. human, which involves administering 

CC the immunogenic composition that is sufficient to elicit the production 

CC of cytotoxic T lymphocytes specific for the prostate specific 

CC polypeptide. The invention is useful for identifying anti-neoplastic 

CC agents. This sequence represents a human prostate specific polypeptide of 

CC the invention. 

XX 

SQ Sequence 885 AA; 

Query Match 91.5%; Score 4531.5; DB 10; Length 885; 

Best Local Similarity 99.7%; Pred. No. 0; 

Matches 855; Conservative 0; Mismatches 0; Indels 3; Gaps 2; 
Qy 1 MRMAATAWAGLQGPPLPTLCPAVRTGLYCRDQAHAERWAMTSETSSGSHCARSRMLRRRA 6 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I 
Db 5 MRMAATAWAGLQGPPLPTLCPAVRTGLYCRDQAHAERWAMTSETSSGSHCA— RMLRRRA 6 2 

Qy 61 QEEDSTVLIDVSPPEAEKRGSYGSTAHASEPGGQQATy^CRAGSPAKPRIADFVLVWEEDL 12 0 
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I I I I I M M M I I I I I I I I I I I I I I I I I I I I I I I I M M I I I I I I I I I I I I I I I I I I I I 
Db 63 QEEDSTVLIDVSPPEAEKRGSYGSTAHASEPGGQQAAACRAGSPAKPRI-DFVLVWEEDL 121 

Qy 121 KLDRQQDSAARDRTDMHRTWRETFLDNLRAAGLCVDQQDVQDGNTTVHYALLSASWAVLC 180 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 12 2 KLDRQQDSAARDRTDMHRTWRETFLDNLRAAGLCVDQQDVQDGNTTVHYALLSASWAVLC 181 

Qy 181 YYAEDLRLKLPLQELPNQASNWSAGLLAWLGIPNVLLEWPDVPPEYYSCRFRVNKLPRF 2 40 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 182 YYAEDLRLKLPLQELPNQASNWSAGLLAWLGIPNVLLEWPDVPPEYYSCRFRVNKLPRF 241 

Qy 241 LGSDNQDTFFTSTKRHQILFEILAKTPYGHEKKNLLGIHQLLAEGVLSAAFPLHDGPFKT 300 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 242 LGSDNQDTFFTSTKRHQILFEILAKTPYGHEKKNLLGIHQLLAEGVLSAAFPLHDGPFKT 301 

Qy 301 PPEGPQAPRLNQRQVLFQHWARWGKWNKYQPLDHVRRYFGEKVALYFAWLGFYTGWLLPA 36 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 302 PPEGPQAPRLNQRQVLFQHWARWGKWNKYQPLDHVRRYFGEKVALYFAWLGFYTGWLLPA 361 

Qy 361 AVVGTLVFLVGCFLVFSDIPTQELCGSKDSFEMCPLCLDCPFWLLSSACALAQAGRLFDH 42 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 362 AWGTLVFLVGCFLVFSDIPTQELCGSKDSFEMCPLCLDCPFWLLSSACALAQAGRLFDH 421 

Qy 421 GGTVFFSLFMALWAVLLLEYWKRKSATLAYRWDCSDYEDTEERPRPQFAASAPMTAPNPI 48 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 422 GGTVFFSLFMALWAVLLLEYWKRKSATLAYRWDCSDYEDTEERPRPQFAASAPMTAPNPI 481 

Qy 481 TGEDEPYFPERSRARRMLAGSWIWMVAVWMCLVSIILYRAIMAIWSRSGNTLLAAW 5 40 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 482 TGEDEPYFPERSRARRMLAGSWIWMVAVWMCLVSIILYRAIMAIWSRSGNTLLAAW 541 

Qy 541 ASRIASLTGSVVNLVFILILSKIYVSLAHVLTRWEMHRTQTKFEDAFTLKVFIFQFVNFY 6 00 

M I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 542 ASRIASLTGSWNLVFILILSKIYVSLAHVLTRWEMHRTQTKFEDAFTLKVFIFQFVNFY 6 01 

Qy 601 SSPVYIAFFKGRFVGYPGNYHTLFGVRNEECAAGGCLIELAQELLVIMVGKQVINNMQEV 66 0 

I I I I I I I I I I I I I I I I I I I M M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 602 SSPVYIAFFKGRFVGYPGNYHTLFGVRNEECAAGGCLIELAQELLVIMVGKQVINNMQEV 661 

Qy 661 LIPKLKGWWQKFRLRSKKRKAGASAGASQGPWEDDYELVPCEGLFDEYLEMVLQFGFVTI 72 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 662 LIPKLKGWWQKFRLRSKKRKAGASAGASQGPWEDDYELVPCEGLFDEYLEMVLQFGFVTI 721 

Qy 721 FVAACPLAPLFALLNNWVEIRLDARKFVCEYRRPVAERAQDIGIWFHILAGLTHLAVISN 780 

I I I I I I I M M I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I 
Db 722 FVAACPLAPLFALLNNWVEIRLDARKFVCEYRRPVAERAQDIGIWFHILAGLTHLAVISN 781 

Qy 781 AFLLAFSSDFLPRAYYRWTRAHDLRGFLNFTLARAPSSFAAAHNRTCRYRAFRDDDGHYS 840 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
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Db 



782 AFLLAFSSDFLPRAYYRWTRAHDLRGFLNFTLARAPSSFAAAHNRTCRYRAFRDDDGHYS 841 



Qy 



841 QTYWNLLAIRLAFVIVFE 85 8 




Db 



842 QTYWNLLAIRLAFVIVFE 859 



RESULT 4 
AEB13424 

ID AEB13424 standard; protein; 843 AA. 

XX 

AC AEB13424; 
XX 

DT 22-SEP-2005 (first entry) 
XX 

DE Human prostate specific polypeptide #1. 
XX 

KW Screening; diagnosis; drug delivery; prostate specific polypeptide; 

KW cancer; prostate tumor; cytostatic; neoplasm. 

XX 

OS Homo sapiens . 
XX 

FN WO2005062788-A2 . 
XX 

PD 14-JUL-2005. 

XX 

PF 16-DEC-2004; 2004WO-US042406 . 
XX 

PR 22-DEC-2003; 2003US-0531809P . 
XX 

PA (AVAL-) AVALON PHARM INC. 
XX 

PI Weigle B, Ebner R; 
XX 

DR WPI; 2005-497793/50. 

DR N-PSDB; AEB13423. 

XX 

PT Novel isolated prostate specific polypeptide, useful for treating cancer, 

PT and identifying agent that modulates activity of cancer related gene. 

XX 

PS Claim 12; SEQ ID NO 3; 59pp; English. 

XX 

CC The invention relates to an isolated prostate specific polypeptide 

CC comprising one or more immunogenic fragments. The invention also relates 

CC to a method of identifying an agent that modulates the activity of a 

CC cancer related gene involving contacting a compound with a cell 

CC containing a gene under conditions promoting the expression of the gene, 

CC detecting a difference in expression of the gene relative to when the 

CC compound is not present and identifying an agent that modulates the 
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CC activity of a cancer related gene, a method of identifying an anti- 

CC neoplastic agent involving contacting a cell exhibiting neoplastic 

CC activity with a compound first identified as a cancer related gene 

CC modulator using and determining a decrease in neoplastic activity after 

CC contacting, when compared to when the contacting does not occur, or 

CC administering an agent first identified to an animal exhibiting a cancer 

CC condition and detecting a decrease in cancerous condition, a method of 

CC determining the cancerous status of a cell involving determining an 

CC increase in the level of expression in a cell of a gene where an elevated 

CC expression relative to a known non-cancerous cell indicates a cancerous 

CC state or potentially cancerous state, an antibody that reacts with a 

CC prostate specific polypeptide, an immunocon jugate comprising the antibody 

CC and a cytotoxic agent, a method of treating cancer involving contacting a 

CC cancerous cell in vivo with an agent having activity against a prostate 

CC specific polypeptide and an immunogenic composition the prostate specific 

CC polypeptide. The prostate specific polypeptide is useful for identifying 

CC an agent that modulates the activity of a cancer related gene. The 

CC immunogenic composition is useful for treating cancer, preferably 

CC prostate cancer in an animal, e.g. human, which involves administering 

CC the immunogenic composition that is sufficient to elicit the production 

CC of cytotoxic T lymphocytes specific for the prostate specific 

CC polypeptide. The invention is useful for identifying ant i-neoplastic 

CC agents. This sequence represents a human prostate specific polypeptide of 

CC the invention. 

XX 

SQ Sequence 843 AA; 

Query Match 88.2%; Score 4364.5; DB 10; Length 843; 

Best Local Similarity 99.6%; Pred. No. 0; 

Matches 824; Conservative 0; Mismatches 0; Indels 3; Gaps 2; 

Qy 1 MRMAATAWAGLQGPPLPTLCPAVRTGLYCRDQAHAERWAMTSETSSGSHCARSRMLRRRA 6 0 

M I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 5 MRMAATAWAGLQGPPLPTLCPAVRTGLYCRDQAHAERWAMTSETSSGSHCA— RMLRRRA 6 2 

Qy 61 QEEDSTVLIDVSPPEAEKRGSYGSTAHASEPGGQQAAACRAGSPAKPRIADFVLVWEEDL 120 

I I I I I I I I I I I I I I I I I I I M M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 63 QEEDSTVLIDVSPPEAEKRGSYGSTAHASEPGGQQAAACRAGSPAKPRI-DFVLVWEEDL 121 

Qy 121 KLDRQQDSAARDRTDMHRTWRETFLDNLRAAGLCVDQQDVQDGNTTVHYALLSASWAVLC 18 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 122 KLDRQQDSAARDRTDMHRTWRETFLDNLRAAGLCVDQQDVQDGNTTVHYALLSASWAVLC 181 

Qy 181 YYAEDLRLKLPLQELPNQASNWSAGLLAWLGIPNVLLEVVPDVPPEYYSCRFRVNKLPRF 2 40 

I I I I I I I M M I I I I I I I I I I I I I I I I I I I I I I I I M M I I I I I I I I I I I I I I I I I I I I I 
Db 182 YYAEDLRLKLPLQELPNQASNWSAGLLAWLGIPNVLLEWPDVPPEYYSCRFRVNKLPRF 241 

Qy 241 LGSDNQDTFFTSTKRHQILFEILAKTPYGHEKKNLLGIHQLLAEGVLSAAFPLHDGPFKT 300 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
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Db 242 LGSDNQDTFFTSTKRHQILFEILAKTPYGHEKKNLLGIHQLLAEGVLSAAFPLHDGPFKT 301 



Qy 301 PPEGPQAPRLNQRQVLFQHWARWGKWNKYQPLDHVRRYFGEKVALYFAWLGFYTGWLLPA 36 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 302 PPEGPQAPRLNQRQVLFQHWARWGKWNKYQPLDHVRRYFGEKVALYFAWLGFYTGWLLPA 361 

Qy 361 AWGTLVFLVGCFLVFSDIPTQELCGSKDSFEMCPLCLDCPFWLLSSACALAQAGRLFDH 42 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M M I I I I I I I I 

Db 36 2 AVVGTLVFLVGCFLVFSDIPTQELCGSKDSFEMCPLCLDCPFWLLSSACALAQAGRLFDH 421 

Qy 421 GGTVFFSLFMALWAVLLLEYWKRKSATLAYRWDCSDYEDTEERPRPQFAASAPMTAPNPI 480 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 422 GGTVFFSLFMALWAVLLLEYWKRKSATLAYRWDCSDYEDTEERPRPQFAASAPMTAPNPI 481 

Qy 481 TGEDEPYFPERSRARRMLAGSVVIWMVAVVVMCLVSIILYRAIMAIWSRSGNTLLAAW 5 40 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 482 TGEDEPYFPERSRARRMLAGSWIWMVAVWMCLVSIILYRAIMAIWSRSGNTLLAAW 541 

Qy 541 ASRIASLTGSWNLVFILILSKIYVSLAHVLTRWEMHRTQTKFEDAFTLKVFIFQFVNFY 600 

I I I I I I I M M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 542 ASRIASLTGSVVNLVFILILSKIYVSLAHVLTRWEMHRTQTKFEDAFTLKVFIFQFVNFY 601 

Qy 601 SSPVYIAFFKGRFVGYPGNYHTLFGVRNEECAAGGCLIELAQELLVIMVGKQVINNMQEV 66 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 602 SSPVYIAFFKGRFVGYPGNYHTLFGVRNEECAAGGCLIELAQELLVIMVGKQVINNMQEV 661 

Qy 661 LIPKLKGWWQKFRLRSKKRKAGASAGASQGPWEDDYELVPCEGLFDEYLEMVLQFGFVTI 72 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 662 LIPKLKGWWQKFRLRSKKRKAGASAGASQGPWEDDYELVPCEGLFDEYLEMVLQFGFVTI 721 

Qy 721 FVAACPLAPLFALLNNWVEIRLDARKFVCEYRRPVAERAQDIGIWFHILAGLTHLAVISN 78 0 

I I I I I I I I I I I I I I I I I I I I M M M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 72 2 FVAACPLAPLFALLNNWVEIRLDARKFVCEYRRPVAERAQDIGIWFHILAGLTHLAVISN 781 

Qy 781 AFLLAFSSDFLPRAYYRWTRAHDLRGFLNFTLARAPSSFAAAHNRTC 827 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 782 AFLLAFSSDFLPRAYYRWTRAHDLRGFLNFTLARAPSSFAAAHNRTC 828 



RESULT 5 
ABG15488 

ID ABG15488 standard; protein; 898 AA. 

XX 

AC ABG15488; 

XX 

DT 18-FEB-2002 (first entry) 
XX 

DE Novel human diagnostic protein #15479. 
XX 
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KW Human; chromosome mapping; gene mapping; gene therapy; forensic; 

KW food supplement; medical imaging; diagnostic; genetic disorder. 
XX 

OS Homo sapiens . 
XX 

PN WO200175067-A2 . 
XX 

PD ll-OCT-2001. 

XX 

PF 30-MAR-2001; 2001WO-US008631 . 

XX 

PR 31-MAR-2000; 2000US-00540217 . 

PR 23-AUG-2000; 2000US-00649167 . 

XX 

PA (HYSE-) HYSEQ INC. 
XX 

PI Drmanac RT, Liu C, Tang YT; 
XX 

DR WPI; 2001-639362/73. 

DR N-PSDB; AAS79675. 
XX 

PT New isolated polynucleotide and encoded polypeptides, useful in 

PT diagnostics, forensics, gene mapping, identification of mutations 

PT responsible for genetic disorders or other traits and to assess 

PT biodiversity. 
XX 

PS Claim 20; SEQ ID NO 45847; 103pp; English. 

XX 

CC The invention relates to isolated polynucleotide (I) and polypeptide (II) 

CC sequences. (I) is useful as hybridisation probes, polymerase chain 

CC reaction (PCR) primers, oligomers, and for chromosome and gene mapping, 

CC and in recombinant production of (II). The polynucleotides are also used 

CC in diagnostics as expressed sequence tags for identifying expressed 

CC genes. (I) is useful in gene therapy techniques to restore normal 

CC activity of (II) or to treat disease states involving (II) . (II) is 

CC useful for generating antibodies against it, detecting or quantitating a 

CC polypeptide in tissue, as molecular weight markers and as a food 

CC supplement. (II) and its binding partners are useful in medical imaging 

CC of sites expressing (II) . (I) and (II) are useful for treating disorders 

CC involving aberrant protein expression or biological activity. The 

CC polypeptide and polynucleotide sequences have applications in 

CC diagnostics, forensics, gene mapping, identification of mutations 

CC responsible for genetic disorders or other traits to assess biodiversity 

CC and to produce other types of data and products dependent on DNA and 

CC amino acid sequences. ABG00010-ABG30377 represent novel human diagnostic 

CC amino acid sequences of the invention. Note: The sequence data for this 

CC patent did not appear in the printed specification, but was obtained in 

CC electronic format directly from WIPO at 

CC ftp . wipo . int/pub/published_pct_sequences 
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Sequence 898 AA; 



Query Match 75.5%; Score 3 736; DB 4; Length 89 8; 

Best Local Similarity 82.3%; Pred. No. 0; 

Matches 72 7; Conservative 4; Mismatches 16; Indels 136; Gaps 6; 

Qy 1 MRMAATAWAGLQGPPLPTLCPAVRTGLYCRDQAHAER 3 7 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1 MRMAATAWAGLQGPPLPTLCPAVRTGLYCRDQAHAERATDWLLAPFCQPKTRSHGTCPP 6 0 

Qy 38 W AMTSETS SG 4 7 

I I : I I I 

Db 61 TERDPRGEGSTEYPGRVDGIQGWGTRALTGWTDRRLLCQACQTLPPRHWFLPGARGWLGG 12 0 

Qy 4 8 SHCA RSRMLRRRAQEEDSTVLIDVSPPEAEKRGSYGSTAH 8 7 

III : I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 121 SPCAHGQESLPSQPSPILLRVESVKSRMLRRRAQEEDSTVLIDVSPPEAEKRGSYGSTAH 180 

Qy 88 ASEPGGQQAAACRAGSPAKPRIADFVLVWEEDLKLDRQQDSAARDRTDMHRTWRETFLDN 147 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II II I I I I I I I 
Db 181 ASEPGGQQAAACRAGSPAKPRIADFVLVWEEDLKLDRQQDSAARDRTDMHRTWRETFLDN 240 

Qy 148 LRAAGLCVDQQDVQDGNTTVHYALLSASWAVLCYYAEDLRLKLPLQELPNQASNWSAGLL 20 7 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I : I : 
Db 241 LRAAGLCVDQQDVQDGNTTVHYALLSASWAVLCYYAEDLRLKLPLQDYPTRPPTGRPACC 300 

Qy 20 8 AWLGIPNVLLEVVPDVPPEYYSCRFRVNKLPRFLGSDNQDTFFTSTKRHQILFEILAKTP 26 7 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 301 AWLGIPNVLLEWPDVPPEYYSCRFRVNKLPRFLGSDNQDTFFTSTKRHQILFEILAKTP 36 0 

Qy 26 8 YGHEKKNLLGIHQLLAEGVLSAAFPLHDGPFKTPPEGPQAPRLNQRQVLFQHWARWGKWN 32 7 

M I II II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 361 YGHEKKNLLGIHQLLAEGVLSAAFPLHDGPFKTPPEGPQAPRLNQRQVLFQHWARWGKWN 42 0 

Qy 32 8 KYQPLDHVRRYFGEKVALYFAWLGFYTGWLLPAAWGTLVFLVGCFLVFSDIPTQELCGS 38 7 

I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 421 KYQPLDHVRRYFGEKVALYFAWLGFYTGWLLPAAWGTLVFLVGCFLVFSDIPTQELCGS 48 0 

Qy 388 KDSFEMCPLCLDCPFWLLSSACALAQ AGRLFDHGGTVFFSLFMALWAVLLLEYWKR 443 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 481 KDSFEMCPLCLDCPFWLLSSACALAQVREEAGRLFDHGGTVFFSLFMALWAVLLLEYWKR 5 40 

Qy 444 KSATLAYRWDCSDYEDTEERPRPQFAASAPMTAPNPITGEDEPYFPERSRARRMLAGSVV 5 03 

I I I I M M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 541 KSATLAYRWDCSDYEDTEERPRPQFAASAPMTAPNPITGEDEPYFPERSRARRMLAGSVV 6 00 

Qy 504 IWMVAVWMCLVSIILYRAIMAIWSRSGNTLLAAWASRIASLTGSWNLVFILILSKI 563 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
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Db 601 IVVMVAWVMCLVSIILYRAIMAIWSRSGNTLLAAWASRIASLTGSWNLVFILILSKI 660 



Qy 



56 4 YVSLAHVLTRWEMHRTQTKFEDAFTLKVFIFQFVNFYSSPVYIAFFKGRFVGYPGNYHTL 623 




Db 



661 YVSLAHVLTRWEMHRTQTKFEDAFTLKVFIFQFVNFYSSPVYIAFFKGRFVGYPGNYHTL 72 0 



Qy 



624 FGVRNEECAAGGCLIELAQELLVIMVGKQVINNMQEVLIPKLKGWWQKFRLRSKKRKAGA 6 83 




Db 



721 FGVRNEECAAGGCLIELAQELLVIMVGKQVINNMQEVLIPKLKGWWQKFRLRSKKRKAGA 78 0 



Db 



Qy 




Qy 



712 VLQFGFVTIFVAACPLAPLFALLNNWVEIRLDARKF 747 




Db 



841 QDSRPEAVLQFGFVTIFVAACPLAPLFALLNNWVEIRLDARKF 883 



RESULT 6 
ADB64420 

ID ADB64420 standard; protein; 920 AA. 

XX 

AC ADB6 4420; 
XX 

DT 15-JUN-2007 (revised) 

DT 04-DEC-2003 (first entry) 

XX 

DE Human protein encoded by clone FEBRA20031280 . 
XX 

KW Human; pharmaceutical; diagnostic; gene therapy; tissue regeneration; 

KW cell regeneration; membrane protein; signal transduct ion-related protein; 

KW transcription-related protein; osteoporosis; neurological disease; 

KW cancer; tumour; BOND_PC; transmembrane protein 160; 

KW transmembrane protein 160 (eight membrane-spanning domains); 

KW transmembrane protein 160 [Homo sapiens]; TMEM160; FLJ34221; FLJ34272; 

KW FLJ35277; MGC130026; unnamed protein product; 

KW unnamed protein product [Homo sapiens] . 

XX 

OS Homo sapiens . 

XX 

PN EP1308459-A2 . 

XX 

PD 07-MAY-2003. 

XX 

PF 28-MAR-2002; 2002EP-00007401 . 
XX 

PR 05-NOV-2001; 2001JP-00379298 . 

PR 25-JAN-2002; 2002US-0350978P . 
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XX 

PA (HELI-) HELIX RES INST. 

PA (REAS-) RES ASSOC BIOTECHNOLOGY. 

XX 

PI Isogai T, Sugiyama T, Otsuki T, Wakamatsu A, Sato H, Ishii S; 

PI Yamamoto J, Isono Y, Hio Y, Otsuka K, Nagai K, Trie R, Tamechika I; 

PI Seki N, Yoshikawa T, Otsuka M, Nagahari K, Masuho Y; 

XX 

DR WPI; 2003-450961/43. 

DR N-PSDB; ADB62450. 

DR PC:NCBI; gi30520318. 
XX 

PT New polynucleotides and polypeptides, useful for developing a diagnostic 

PT marker or medicines for regulation of their expression and activity, or 

PT as targets of gene therapy. 
XX 

PS Claim 1; Page; 222pp; English. 

XX 

CC The invention discloses a polynucleotide comprising a sequence selected 

CC from 19 70 fully defined nucleotide sequences which encode novel 

CC polypeptides. Also claimed is a polypeptide encoded by the polynucleotide 

CC or its partial peptide, an antibody binding to the polypeptide or peptide 

CC of the polynucleotide, immunologically assaying the polypeptide or 

CC peptide of the polynucleotide by contacting the polypeptide or peptide 

CC with the antibody of the encoded protein, and observing the binding 

CC between the two, a transformant carrying the polynucleotide in an 

CC expressible manner and an antisense polynucleotide. The oligonucleotide 

CC is useful as a primer for synthesising the polynucleotide, or as a probe 

CC for detecting the polynucleotide. The polynucleotides and encoded 

CC proteins are useful as pharmaceutical agents and many disease-related 

CC genes may be included in them, for developing a diagnostic marker or 

CC medicines for regulation of their expression and activity, or as targets 

CC of gene therapy. The genes are involved in tissue and/or cell 

CC regeneration. Membrane proteins, signal transduction-related proteins, 

CC transcription-related proteins, disease-related proteins and genes 

CC encoding them can be used as indicators for diseases (e.g. osteoporosis, 

CC neurological diseases, cancer, tumours. The cDNA may be used to regulate 

CC the activity or expression of the encoded protein to treat diseases. The 

CC sequence presented is a protein of the invention. Note: Some of the 

CC sequence data for this patent is not represented in the printed 

CC specification, but is based on sequence information supplied by the 

CC European Patent Office. 

CC 

CC Revised record issued on 15-JUN-2007 : Enhanced with precomputed 

CC information from BOND. 
XX 

SQ Sequence 920 AA; 

Query Match 30.9%; Score 1531.5; DB 6; Length 920; 
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Best Local Similarity 37.9%; Pred. No. l.le-147; 

Matches 360; Conservative 168; Mismatches 316; Indels 105; Gaps 29; 

Qy 44 TSSGSHCARSRMLRRRAQEEDSTVLID VSPPEAE KRGSYGST AHASEP 91 

: I I I : : : : I : I : I I III: I : I I I I 

Db 4 SSSGITNGKTKVFHPVA — KDVNILFDELEAVSSPCKDDDSLLHPGNLTSTSDDASRLEA 61 

Qy 92 GGQQAAACRAGS PAKPRIADFVLVWEEDLKLDRQQDSAARDRTDMHRTWRETFLD 146 

II: : I I I I : : I I : : : : I : III 
Db 62 GGETVPERNKSNGLYFRDGKCRI-DYILVYRK SNPQTEK REVFER 105 

Qy 14 7 NLRAAGLCVDQQDVQDGNTTVHYALLSASWAVLCYYAEDLRLKLPLQE LPNQASNW 202 

I : I I I I : : : : I : : : I I I I I I I I : : : : I : I I : 

Db 106 NIRAEGLQMEKESSLI-NSDIIFVKLHAPWEVLGRYAEQMNVRMPFRRKIYYLPRRYKFM 16 4 

Qy 203 S AGLLAWLGIPNVLL — EWPDVPP-EYYSCRFRVNKLPRFLGSDNQDTFFTST 253 

I : I II : I I : I I : : I : I : : I : I : : I I I : 

Db 165 SRIDKQISRLRRWLPKKPMRLDKETLPDLEENDCYTAPFSQQRIHHFI-IHNKETFFNNA 223 

Qy 254 KRHQILFEILAKTPYGHEKKNLLGIHQLLAEGVLSAAFPLHDGPFKTPPEGPQAPRLNQR 313 

1:1: II: I I I I : I : : : I I I | | | | | | : | : : : II 
Db 224 TRSRIVHHILQRIKY-EEGKNKIGLNRLLTNGSYEAAFPLHEGSYRSKNSIRTHGAENHR 282 

Qy 314 QVLFQHWARWGKWNKYQPLDHVRRYFGEKVALYFAWLGFYTGWLLPAAWGTLVFLVGCF 3 73 

: I : : II II I I I I I I I I I I I I I I I : I I I I I I I : I I I I III : I III I 

Db 283 HLLYECWASWGVWYKYQPLDLVRRYFGEKIGLYFAWLGWYTGMLFPAAFIGLFVFLYGVT 342 

Qy 3 74 LVFSDIPTQELCGSKDSFEMCPLC-LDCPFWLLSSACALAQAGRLFDHGGTVFFSLFMAL 432 

: : : I : I : I I I I : I III I I : I I : I I I : I I I I I : : I I I : 

Db 343 TLDHSQVSKEVCQATDII-MCPVCDKYCPFMRLSDSCVYAKVTHLFDNGATVFFAVFMAV 401 

Qy 433 WAVLLLEYWKRKSATLAYRWDCSDYEDTEERPRPQFAAS-APMTAPNPITGEDEPYFPER 491 

I I : I I : I I I : I : I I I I I : I : I I I I I I I : I I I : I : I I I 

Db 402 WATVFLEFWKRRRAVIAYDWDLIDWEEEEEEIRPQFEAKYSKKERMNPISGKPEPYQAFT 461 

Qy 492 SRARRMLAGSVVIVVMVAVWMCLVSIILYRAIMAIWSRSGNTLLA-AWA SRIA 545 

: I : : : I I : I I : : I : : I I : : I I II I : : I 

Db 46 2 DKCSRLIVSASGIFFMICWIAAVFGIVIYRWTV STFAAFKWALIRNNSQVA 514 

Qy 546 SLTGSW— NLVFILILSKIYVSLAHVLTRWEMHRTQTKFEDAFTLKVFIFQFVNFYSSP 6 03 

: I I : I I I : : I : : I : I : I I I I I : : : : I : : I I I I : I : I I I I I II 
Db 515 T-TGTAVCINFCIIMLLNVLYEKVALLLTNLEQPRTESEWENSFTLKMFLFQFVNLNSST 5 73 

Qy 604 VYIAFFKGRFVGYPGNYHTLFG-VRNEECAAGGCLIELAQELLVIMVGKQVINNMQEVLI 662 

111111111:111 I I I I I I I I I : I : : : I I I I I II I : 

Db 574 FYIAFFLGRFTGHPGAYLRLINRWRLEECHPSGCLIDLCMQMGIIMVLKQTWNNFMELGY 633 

Qy 663 PKLKGWWQKFRLRSKKRKAGASAGASQGPWEDDYELVPCE — GLFDEYLEMVLQFGFVTI 72 0 

I : : I I : I : : : I I I I I I I I I I I I I I I I I : I I I I I I I 
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Db 63 4 PLIQNWWTR RKVRQEHGPERKISFPQWEKDYNLQPMNAYGLFDEYLEMILQFGFTTI 6 90 



Qy 



721 FVAACPLAPLFALLNNWVEIRLDARKFVCEYRRPVAERAQDIGIWFHILAGLTHLAVISN 78 0 



Db 



691 FVAAFPLAPLLALLNNIIEIRLDAYKFVTQWRRPLASRAKDIGIWYGILEGIGILSVITN 75 0 



Db 



Qy 



781 AFLLAFSSDFLPRAYYRW TRAHDLRGFLNFTLA RAP 816 

I I : : I : I I I : I I I : : | : : | : | : 

751 AFVIAITSDFIPRLVYAYKYGPCAGQGEAGQKCMVGYVNASLSVFRI SDFENRSEPESDG 810 



Qy 



817 SSFAAAHNRTCRYRAFRDDDGH YSQTYWNLLAIRLAFVIVFEHVVFSVGRLLDLL 8 71 




Db 



811 SEFSGTPLKYCRYRDYRDPPHSLVPYGYTLQFWHVLAARLAFIIVFEHLVFCIKHLISYL 8 70 



Qy 



872 VPDIPESVEIKVKREYYLAKQALAENEVLFGTNGTKDEQPKGSELSSHW 920 



Db 



871 IPDLPKDLRDRMRREKYLIQEMMYEAELERLQKERKERKKNGKAHHNEW 919 



RESULT 7 
ABP58666 

ID ABP58666 standard; protein; 920 AA. 

XX 

AC ABP58666; 
XX 

DT 24-MAR-2003 (first entry) 
XX 

DE Human dihydropyrimidinase related protein 1-101.20. 
XX 

KW Human; dihydropyrimidinase related protein 1-101.20; 

KW recombinant production; gene therapy; psychosis; development disorder; 

KW uracil-related metabolic disorder; thymine-related metabolic disorder; 

KW pyrimidine metabolic disorder. 
XX 

OS Homo sapiens . 

XX 

PN CN1364894-A. 

XX 

PD 21-AUG-2002. 

XX 

PF lO-JAN-2001; 2001CN-00105195 . 
XX 

PR lO-JAN-2001; 2001CN-00105195 . 
XX 

PA (BIOW-) BIOWINDOW GENE DEV INC SHANGHAI. 
XX 

PI Mao Y, Xie Y; 

XX 

DR WPI; 2003-000532/01. 
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DR N-PSDB; ABZ57080. 
XX 

PT New polypeptide-human dihydropyrimidinase relative protein 1-101, 20 and 

FT polynucleotide for encoding such polypeptide. 

XX 

PS Claim 1; Page 28-30 (Disclosure); 36pp; Chinese. 
XX 

CC The invention relates to human dihydropyrimidinase related protein 1- 

CC 101.20 (ABP58666) and nucleic acids encoding it (ABZ57080). The protein 

CC has a molecular weight of 101.2 kD . The invention also relates to a 

CC method for the recombinant production of the protein, an antagonist of 

CC the protein, and the use of the protein, gene and antagonist in 

CC therapeutic applications. Dihydropyrimidinase related protein 1-101.20 

CC can be used in the treatment of a variety of diseases such as psychosis, 

CC development disorders and uracil- and thymine-related metabolic 

CC disorders. The present sequence represents human dihydropyrimidinase 

CC related protein 1-101.20 

XX 

SQ Sequence 920 AA; 

Query Match 30.5%; Score 1511.5; DB 6; Length 920; 

Best Local Similarity 37.6%; Pred . No. 1.2e-145; 

Matches 357; Conservative 169; Mismatches 318; Indels 105; Gaps 29; 

Qy 44 TSSGSHCARSRMLRRRAQEEDSTVLID VSPPEAE KRGSYGST AHASEP 91 

: I I I : : : : I : I : I I III: I : I I I I 

Db 4 SSSGITNGKTKVFHPVA— KDVNILFDELEAVSSPCKDDDSLLHPGNLTSTSDDASRLEA 61 

Qy 92 GGQQAAACRAGS PAKPRIADFVLVWEEDLKLDRQQDSAARDRTDMHRTWRETFLD 146 

II: : I I I I : : I I : : : : I : III 

Db 6 2 GGETVPERNKSNGLYFRDGKCRI-DYILVYRK SNPQTEK REVFER 105 

Qy 14 7 NLRAAGLCVDQQDVQDGNTTVHYALLSASWAVLCYYAEDLRLKLPLQE LPNQASNW 2 02 

I : I I I I : : : : I : : : I I I I I I I I : : : : I : I I : 

Db 106 NIRAEGLQMEKESSLI-NSDIIFVKLHAPWEVLGRYAEQMNVRMPFRRKIYYLPRRYKFM 16 4 

Qy 203 S AGLLAWLGIPNVLL — EWPDVPP-EYYSCRFRVNKLPRFLGSDNQDTFFTST 253 

I : II : I I : I I : : I : I : : I : I : : I II : 

Db 165 SRIDKQISRFRRWLPKKPMRLDKETLPDLEENDCYTAPFSQQRIHHFI-IHNKETFFNNA 223 

Qy 25 4 KRHQILFEILAKTPYGHEKKNLLGIHQLLAEGVLSAAFPLHDGPFKTPPEGPQAPRLNQR 313 

I : I : II: I I M : I : : : I I I I I I I I I : I : : : II 

Db 224 TRSRIVHHILQRIKY-EEGKNKIGLNRLLTNGSYEAAFPLHEGSYRSKNSIRTHGAENHR 282 

Qy 314 QVLFQHWARWGKWNKYQPLDHVRRYFGEKVALYFAWLGFYTGWLLPAAVVGTLVFLVGCF 3 73 

: I : : II II I I I I I I I I I I I I I I I : I I I I I I I : I I I I III : I III I 
Db 283 HLLYECWASWGVWYKYQPLDLVRRYFGEKIGLYFAWLGWYTGMLFPAAFIGLFVFLYGVT 3 42 

Qy 374 LVFSDIPTQELCGSKDSFEMCPLC-LDCPFWLLSSACALAQAGRLFDHGGTVFFSLFMAL 432 



http://es/ScoreAccessWeb/Getltem.action?Appld=10552...4_083145_us-10-552-515-l.rag&ltemType==4&startByte=0 (22 of 44)10/10/200 8:48:10 AM 



SCORE Search Results Details for Application 10552515 and Search Result 20080624_083145_us-10-552-515-l.rag. 



: : : I : I : I I I I : I III I I : I I : j jj : | | | | | : : | | | : 

Db 343 TLDHSQVSKEVCQATDII-MCPVCDKYCPFMRLSDSCVYAKVTHLFDNGATVFFAVFMAV 401 

Qy 433 WAVLLLEYWKRKSATLAYRWDCSDYEDTEERPRPQFAAS-APMTAPNPITGEDEPYFPER 491 

I I : I I : I I I : I : I I I I I : I : I I I I I I I : I I I : I : I I I 

Db 402 WATVFLEFWKRRRAVIAYDWDLIDWEEEEEEIRPQFEAKYSKKERMNPISGKPEPYQAFT 461 

Qy 492 SRARRMLAGSWIWMVAVWMCLVSIILYRAIMAIWSRSGNTLLA-AWA SRIA 545 

: I : : : I I : I I : : I : : I I : : I I II I : : I 

Db 462 DKCSRLIVSASGIFFMICWIAAVFGIVIYRWTV STFAAFKWALIRNNSQVA 514 

Qy 546 SLTGSW— NLVFILILSKIYVSLAHVLTRWEMHRTQTKFEDAFTLKVFIFQFVNFYSSP 6 03 

: I I : I I I : : I : : I : I : I I I I I : : : : I : : I I I I : I : I I I II II 

Db 515 T-TGTAVCINFCIIMLLNVLYEKVALLLTNLEQPRTESEWENSFTLKMFLFQFVNLNSST 5 73 

Qy 604 VYIAFFKGRFVGYPGNYHTLFG-VRNEECAAGGCLIELAQELLVIMVGKQVINNMQEVLI 662 

111111111:111 I I I I I I I I I : I : : : I I I I I II I : 

Db 574 FYIAFFLGRFTGHPGAYLRLINRWRLEECHPSGCLIDLCMQMGIIMVLKQTWNNFMELGY 633 

Qy 663 PKLKGWWQKFRLRSKKRKAGASAGASQGPWEDDYELVPCE — GLFDEYLEMVLQFGFVTI 72 0 

I : : I I : I : : : I I I I I I I I I I : I I I I II : I II II II 

Db 634 PLIQNWWTR RKVRQEHGPERKISFPQWEKDYNLQPMNAYGLYDEYLEMILQFGFTTI 690 

Qy 721 FVAACPLAPLFALLNNWVEIRLDARKFVCEYRRPVAERAQDIGIWFHILAGLTHLAVISN 780 

I I I I I I I I I I I I I I : I I I I I I III : : I I I : I I I : I I I I : II I : I : I I : I 

Db 691 FVAAFPLAPLLALLNNIIEIRLDAYKFVTQWRRPLASRAKDIGIGYGILEGIGILSVITN 750 

Qy 781 AFLLAFSSDFLPRAYYRW TRAHDLRGFLNFTLA RAP 816 

I I : : I : I I I : I I I : : I : : I : I : 

Db 751 AFVIAITSDFIPRLVYAYKYGPCAGQGEAGQKCMVGYVNASLSVFRISDFENRSEPESDG 810 

Qy 817 SSFAAAHNRTCRYRAFRDDDGH YSQTYWNLLAIRLAFVIVFEHVVFSVGRLLDLL 871 

II: : I I I I : I I I : : I : : I I I I I I : I I I I I : I I : I : I 

Db 811 SEFSGTPLKYCRYRDYRDPPHSLVPYGYTLQFWHVLAARLAFIIVFEHLVFCIKHLISYL 8 70 

Qy 8 72 VPDIPESVEIKVKREYYLAKQALAENEVLFGTNGTKDEQPKGSELSSHW 92 0 

: I I : I : : : : : I I I I : : : I I : I : : I : I 

Db 871 IPDLPKDLRDRMRREKYLIQEMMYEAELERLQKERKERKKNGKAHHNEW 919 



RESULT 8 
ADK52114 

ID ADK52114 standard; protein; 981 AA. 

XX 

AC ADK52114; 
XX 

DT 15-JUN-2007 (revised) 

DT 20-MAY-2004 (first entry) 

XX 
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DE Human atopic dermatitis/psoriasis-associated protein #29. 
XX 

KW Human; atopic dermatitis; psoriasis; dermatological; anti-inflammatory; 

KW antipsoriatic; rash; BOND_PC; transmembrane protein 16C; 

KW chromosome 11 open reading frame 25; 

KW transmembrane protein 16C [Homo sapiens]; TMEM16C; Cllorf25; GENX-3947; 

KW transmembrane protein 16C (eight membrane-spanning domains); 

KW hypothetical protein; hypothetical protein [Homo sapiens]; GO16020; 

KW GO16021; G04185; GO7001. 

XX 

OS Homo sapiens. 
XX 

PN WO2004016785-A1 . 

XX 

PD 26-FEB-2004. 
XX 

PF 06-AUG-2003; 2003WO-JP009999 . 
XX 

PR 06-AUG-2002; 2002 JP-00229319 . 

PR 14-MAY-2003; 2003 JP-00136544 . 
XX 

PA (GENO-) GENOX RES INC. 

PA (UYJU-) UNIV JUNTENDO. 
XX 

PI Itoh M, Ogawa K, Shinagawa A, Sudo H, Ogawa H, Ra C; 

PI Mitsuishi K; 

XX 

DR WPI; 2004-214514/20. 

DR N-PSDB; ADK52028. 

DR PC:NCBI; gil3899227. 

DR PC:SWISSPROT; Q9BYT9 . 
XX 

PT Detecting atopic dermatitis or psoriasis comprises assaying levels of 

PT expression of an indicator gene at a rash site and non-rash site of a 

PT person with atopic dermatitis or psoriasis. 

XX 

PS Example 2; SEQ ID NO 147; 484pp; Japanese. 

XX 

CC The invention relates to detecting atopic dermatitis or psoriasis 

CC comprising assaying the levels of expression of an indicator gene at a 

CC rash site and non-rash site of a person with atopic dermatitis or 

CC psoriasis, comparing these levels with those of a healthy person, and 

CC determining that if the levels of indicators are higher or lower, then 

CC this indicates the disease. Also included are a reagent for detecting 

CC atopic dermatitis or psoriasis, a k±t for screening for treatments, a 

CC transgenic non human vertebrate animal models for the diseases, an agent 

CC for inducing the diseases in mice and a DNA chip for assaying for the 

CC indicator genes. The method is used for treatment, detection and animal 

CC models for research of atopic dermatitis and psoriasis. The present 
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CC sequence is a protein encoded by an indicator gene of the invention. 
CC 

CC Revised record issued on 15-JUN-2007 : Enhanced with precomputed 

CC information from BOND. 

XX 

SQ Sequence 981 AA; 

Query Match 30.4%; Score 1504; DB 8; Length 981; 

Best Local Similarity 39.4%; Pred. No. 8.1e-145; 

Matches 329; Conservative 163; Mismatches 268; Indels 76; Gaps 24; 

Qy 106 KPRIADFVLVWEEDLKLDRQQDSAARDRTDMHRTWRETFLDNLRAAGLCVDQQDVQDGNT 16 5 

I I I I : : I I : : I : : III I I I I I I : : : : : 

Db 161 KRRI-DYILVYR KTNIPYDKRNTFEKNLRAEGLMLEKEPA-IASP 2 03 

Qy 16 6 TVHYALLSASWAVLCYYAEDLRLKLPLQ ELPNQASNWSAGLLAWLGIPNV 215 

: : : I I I I I I I : : : I : : : : : I : : 

Db 204 DIMFIKIHIPWDTLCKYAERLNIRMPFRKKCYYTDGRSKSMGRMQTYFRRIKDWMAQNPM 263 

Qy 216 LLE — VVPDV-PPEYYSCRFRVNKLPRFLGSDNQDTFFTSTKRHQILFEILAKTPY — GH 2 70 

: I : II: : I : I : : I : : I : I I I I : : I : I : : : I : I I I 

Db 26 4 VLDKSAFPDLEESDCYTGPFSRARIHHFI-INNKDTFFSNATRSRIVYHMLERTKYENGI 322 

Qy 271 EKKNLLGIHQLLAEGVLSAAFPLHDGPFKT PPEGPQAPRLNQRQVLFQHWARWGKW 326 

I : I I : I : I I I I I I : I : I : III I I : I :: I I I I I I 

Db 323 SK VGIRKLINNGSYIAAFPPHEGAYKSSQPIKTHGPQ NNRHLLYERWARWGMW 3 75 

Qy 327 NKYQPLDHVRRYFGEKVALYFAWLGFYTGWLLPAAWGTLVFLVGCFLVFSDIPTQELCG 386 

I : I I I I : I I I I I I : I I I I I I I : I I I I : I I I : I I II II:: : I I : I 
Db 376 YKHQPLDLIRLYFGEKIGLYFAWLGWYTGMLIPAAIVGLCVFFYGLFTMNNSQVSQEICK 435 

Qy 387 SKDSFEMCPLC-LDCPFWLLSSACALAQAGRLFDHGGTVFFSLFMALWAVLLLEYWKRKS 445 

: : I M I I I : I I : : I I : I I I : I I I I I I : : I I I : II : II : I I I : 

Db 436 ATEVF-MCPLCDKNCSLQRLNDSCIYAKVTYLFDNGGTVFFAIFMAIWATVFLEFWKRRR 494 

Qy 446 ATLAYRWDCSDYEDTEERPRPQFAAS-APMTAPNPITGEDEPYFPERSRARRMLAGSVVI 504 

: I I II ::!:!! Mill I I II II : II : I : I : I I 

Db 495 SILTYTWDLIEWEEEEETLRPQFEAKYYKMEIVNPITGKPEPHQPSSDKVTRLLVSVSGI 554 

Qy 505 WMVAVWMCLVSIILYR-AIMAIWSRSGNTLLAAWASRIASLTGSV-VNLVFILILSK 562 

I : : : I : : : : : I I : I I I : I : I : : I : I : I : : I : 

Db 555 FFMISLVITAVFGVWYRLWMEQFASFKWNFIKQYW — QFATSAAAVCINFI IIMLLNL 612 

Qy 563 lYVSLAHVLTRWEMHRTQTKFEDAFTLKVFIFQFVNFYSSPVYIAFFKGRFVGYPGNYHT 622 

I : I : : M I I I : : : : I : : I I I : I : I I I I I II I I I I I I I I I I : I I I : 
Db 613 AYEKIAYLLTNLEYPRTESEWENSFALKMFLFQFVNLNSSIFYIAFFLGRFVGHPGKYNK 6 72 

Qy 623 LFG-VRNEECAAGGCLIELAQELLVIMVGKQVINNMQEVLIPKLKGWWQKFRLRSKKRKA 681 

II I I I I 1111:1 : : I I I I I : I I I : I : : I I : : : I 
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Db 6 73 LFDRWRLEECHPSGCLIDLCLQMGVIMFLKQIWNNFMELGYPLIQNWWSRHKI KR 72 7 

Qy 682 GASAGASQGPWEDDYELVP — CEGLFDEYLEMVLQFGFVTIFVAACPLAPLFALLNNWVE 73 9 

I II I I : I : I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I : I 

Db 728 GIH-DASIPQWENDWNLQPMNLHGLMDEYLEMVLQFGFTTIFVAAFPLAPLLALLNNIIE 786 

Qy 740 IRLDARKFVCEYRRPVAERAQDIGIWFHILAGLTHLAVISNAFLLAFSSDFLPRAYYRW- 798 

I I I I I III : : I I I : II I I I I I II I : I I I I : I I I : : I : I I : : I I I : 

Db 78 7 IRLDAYKFVTQWRRPLPARATDIGIWLGILEGIGILAVITNAFVIAITSDYIPRFVYEYK 8 46 

Qy 79 9 TRAHDLRGFLNFTLARAP-SSFAAAHNRTCRYRAFR DDDGHYSQTY 8 43 

: I : I : : I : I : I : I I I I : I : : I 

Db 847 YGPCANHVEPSENCLKGYVNNSLSFFDLSELGMGKSGYCRYRDYRGPPWSSKPYEFTLQY 906 

Qy 844 WNLLAIRLAFVIVFEHVVFSVGRLLDLLVPDIPESVEIKVKREYYLAKQALAENEV 899 

I : : I I 1111:11111:11 : : I : I I : I : : : : : I I II : : : I I : 
Db 907 WHILAARLAFIIVFEHLVFGIKSFIAYLIPDVPKGLHDRIRREKYLVQEMMYEAEL 962 



RESULT 9 
AEN06206 

ID AEN06206 standard; protein; 981 AA. 

XX 

AC AEN06206; 
XX 

DT 15-JUN-2007 (revised) 

DT 22-FEB-2007 (first entry) 

XX 

DE Human esophageal cancer-associated protein SEQ ID NO 231. 
XX 

KW diagnostic; metastasis; esophagus tumor; gastrointestinal disease; 

KW neoplasm; cytostatic; cancer; AXL; ZBTBll; TNFRSF14; NSUN5; SPEN; LTBP3; 

KW SYNGRl; SLC13A1; MAP3K12; GLYAT; ZNF659; B4GALT2; POGK; AQP3; CAPG; 

KW SLIT2; BOND_PC; transmembrane protein 16C; 

KW chromosome 11 open reading frame 25; 

KW transmembrane protein 16C [Homo sapiens]; TMEM16C; Cllorf25; GENX-3947; 

KW transmembrane protein 16C (eight membrane-spanning domains); 

KW hypothetical protein; hypothetical protein [Homo sapiens]; GO16020; 

KW GO16021; G04185; GO7001. 

XX 

OS Homo sapiens . 
XX 

PN WO2006118308-A1 . 
XX 

PD 09-NOV-2006. 
XX 

PF 02-MAY-2006; 2006WO-JP309177 . 
XX 

PR 02-MAY-2005; 2005 JP-00134530 . 
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PR 13-SEP-2005; 2005JP-00265645 . 

PR 13-SEP-2005; 2005 JP-002656 78 . 
XX 

PA (TORA ) TORAY IND INC. 

PA (KYOU ) UNIV KYOTO. 

XX 

PI Akiyama H, Kozono S, Myomoto A, Nomura 0, Nobumasa H, Tanaka Y; 

PI Tomoda S, Shimada Y, Tsujimoto G; 

XX 

DR WPI; 2007-110304/11. 

DR PC:NCBI; gil3899227. 

DR PC:SWISSPROT; Q9BYT9 . 
XX 

PT Composition for determining occurrence/metastasis of esophageal cancer in 

PT subject, comprises an antibody binding to a polypeptide encoded by a 

PT polynucleotide having a sequence of genes e.g. AXL, ZBTBll and TNFRSF14, 

PT and/or polynucleotides. 

XX 

PS Claim 1; SEQ ID NO 231; 142pp; Japanese. 
XX 

CC This invention describes a novel composition for detecting metastasis of 

CC esophageal cancer in a test subject. The composition contains a probe 

CC derived from polynucleotides AXL, ZBTBll, TNFRSF14, NSUN5, SPEN, LTBP3, 

CC SYNGRl, SLC13A1, MAP3K12, GLYAT, ZNF659, B4GALT2, POGK, AQP3, CAPG, 

CC SLIT2, their variants or fragments and an antibody. The invention also 

CC claims: a) a kit for detecting, determining or presuming the occurrence 

CC or metastasis of esophageal cancer in a test subject; b) a DNA chip for 

CC detecting, determining or presuming the occurrence or metastasis of 

CC esophageal cancer and c) a method to detect, determine or presume the 

CC occurrence or metastasis of esophageal cancer in a test subject by 

CC detecting the presence of or amount or expression level of one or more 

CC esophagus-cancer related target nucleic acid in a biological sample. The 

CC method enables the rapid and convenient detection of occurrence or 

CC metastasis of esophageal cancer in test subject with high sensitivity. 

CC This sequence represents a protein used in the method of the invention 

CC 

CC Revised record issued on 15-JUN-2007 : Enhanced with precomputed 

CC information from BOND. 

XX 

SQ Sequence 981 AA; 

Query Match 30.4%; Score 1504; DB 12; Length 981; 
Best Local Similarity 39.4%; Pred. No. 8.1e-145; 

Matches 329; Conservative 163; Mismatches 268; Indels 76; Gaps 24; 

Qy 106 KPRIADFVLVWEEDLKLDRQQDSAARDRTDMHRTWRETFLDNLRAAGLCVDQQDVQDGNT 16 5 
I I I I : : I I : : I : : III I I I I I I : : : : : 

Db 161 KRRI-DYILVYR KTNIPYDKRNTFEKNLRAEGLMLEKEPA-IASP 203 
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Qy 16 6 TVHYALLSASWAVLCYYAEDLRLKLPLQ ELPNQASNWSAGLLAWLGIPNV 215 

: : : I I I | I I I : : : I : : : : : I : : 

Db 204 DIMFIKIHIPWDTLCKYAERLNIRMPFRKKCYYTDGRSKSMGRMQTYFRRIKDWMAQNPM 263 

Qy 216 LLE — VVPDV-PPEYYSCRFRVNKLPRFLGSDNQDTFFTSTKRHQILFEILAKTPY — GH 2 70 

: I : M : : I : I : : I : : I : I I I I : : | : | : : : | : | | | 

Db 264 VLDKSAFPDLEESDCYTGPFSRARIHHFI-INNKDTFFSNATRSRIVYHMLERTKYENGI 322 

Qy 2 71 EKKNLLGIHQLLAEGVLSAAFPLHDGPFKT PPEGPQAPRLNQRQVLFQHWARWGKW 326 

I : I I : I : I I I I I I : I : I : III | | : | : : | | | | | | 

Db 323 SK VGIRKLINNGSYIAAFPPHEGAYKSSQPIKTHGPQ NNRHLLYERWARWGMW 3 75 

Qy 32 7 NKYQPLDHVRRYFGEKVALYFAWLGFYTGWLLPAAWGTLVFLVGCFLVFSDIPTQELCG 386 

I : I I I I : I I I I I I : I I I I I I I : I I I I : I I I : I I II II:: : I I : I 
Db 376 YKHQPLDLIRLYFGEKIGLYFAWLGWYTGMLIPAAIVGLCVFFYGLFTMNNSQVSQEICK 435 

Qy 387 SKDSFEMCPLC-LDCPFWLLSSACALAQAGRLFDHGGTVFFSLFMALWAVLLLEYWKRKS 445 

:: I I I I I I : I I : : I I : M I : I I I I I I : : I I I : I I : I I : I I I : 

Db 436 ATEVF-MCPLCDKNCSLQRLNDSCIYAKVTYLFDNGGTVFFAIFMAIWATVFLEFWKRRR 49 4 

Qy 446 ATLAYRWDCSDYEDTEERPRPQFAAS-APMTAPNPITGEDEPYFPERSRARRMLAGSVVI 504 

: I I I I : : I : I I I I I I I I I I I I I : I I : I : I : I I 

Db 495 SILTYTWDLIEWEEEEETLRPQFEAKYYKMEIVNPITGKPEPHQPSSDKVTRLLVSVSGI 554 

Qy 505 WMVAVWMCLVSIILYR-AIMAIWSRSGNTLLAAWASRIASLTGSV-VNLVFILILSK 562 

I : : : I : : : : : I I : I I I : I : I : : I : I : I : : I : 

Db 555 FFMISLVITAVFGVWYRLWMEQFASFKWNFIKQYW— QFATSAAAVCINFIIIMLLNL 612 

Qy 563 lYVSLAHVLTRWEMHRTQTKFEDAFTLKVFIFQFVNFYSSPVYIAFFKGRFVGYPGNYHT 6 22 

I : I : : I I I I I : : : : I : : I I I : I : I I I I I II I I I I I I I I I I : I I I : 

Db 613 AYEKIAYLLTNLEYPRTESEWENSFALKMFLFQFVNLNSSIFYIAFFLGRFVGHPGKYNK 6 72 

Qy 623 LFG-VRNEECAAGGCLIELAQELLVIMVGKQVINNMQEVLIPKLKGWWQKFRLRSKKRKA 681 

II I I I I I I I I : I : : I I I I I : I I I : I : : I I : : : I 
Db 6 73 LFDRWRLEECHPSGCLIDLCLQMGVIMFLKQIWNNFMELGYPLIQNWWSRHKI KR 72 7 

Qy 682 GASAGASQGPWEDDYELVP — CEGLFDEYLEMVLQFGFVTIFVAACPLAPLFALLNNWVE 739 

I M I I : I : I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I : I 

Db 728 GIH-DASIPQWENDWNLQPMNLHGLMDEYLEMVLQFGFTTIFVAAFPLAPLLALLNNIIE 786 

Qy 740 IRLDARKFVCEYRRPVAERAQDIGIWFHILAGLTHLAVISNAFLLAFSSDFLPRAYYRW- 798 

I I I I I III : : I I I : II I I I I I M ! : I I I I : I I I : : I : I I : : I I I : 

Db 78 7 IRLDAYKFVTQWRRPLPARATDIGIWLGILEGIGILAVITNAFVIAITSDYIPRFVYEYK 8 46 

Qy 79 9 TRAHDLRGFLNFTLARAP-SSFAAAHNRTCRYRAFR DDDGHYSQTY 8 43 

: I : I : : I : I : I : I I I I : I : : I 

Db 847 YGPCANHVEPSENCLKGYVNNSLSFFDLSELGMGKSGYCRYRDYRGPPWSSKPYEFTLQY 906 

Qy 844 WNLLAIRLAFVIVFEHWFSVGRLLDLLVPDIPESVEIKVKREYYLAKQALAENEV 899 
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Db 907 WHILAARLAFIIVFEHLVFGIKSFIAYLIPDVPKGLHDRIRREKYLVQEMMYEAEL 962 



RESULT 10 
AFB77190 

ID AFB77190 standard; protein; 1017 AA. 
XX 

AC AFB7 719 0; 
XX 

DT 28-JUN-2007 (first entry) 
XX 

DE Mouse TM-1 (Tmeml6a) protein. 

XX 

KW Cell isolation; stem cell; therapeutic; transgenic animal; screening; 

KW tissue regeneration; genitourinary disease; uropathic; 

KW intervertebral disk displacement; degeneration; injury; vulnerary; 

KW back pain; transmembrane factor-1; Tmeml6a. 

XX 

OS Mus musculus. 

XX 

PN WO2007027583-A2 . 
XX 

PD 08-MAR-2007. 

XX 

PF 28-AUG-2006; 2006WO-US033491 . 

XX 

PR 31-AUG-2005; 2005US-0713400P . 
XX 

PA (UYFL ) UNIV FLORIDA RES FOUND INC. 
XX 

PI Harfe BD, Cohn MJ; 
XX 

DR WPI; 2007-412931/39. 

DR N-PSDB; AFB77189. 

XX 

PT Isolating sonic hedgehog expressing-cells comprises obtaining a non-human 

PT transgenic subject in which a marker gene has been inserted into the 

PT subject's genome. 

XX 

PS Disclosure; SEQ ID NO 2; 96pp; English. 

XX 

CC The present invention relates to a method of isolating cells in selected 

CC tissues co-expressing the sonic hedgehog (Shh) gene and a marker gene. 

CC The method involves obtaining a non-human transgenic subject in which a 

CC marker gene has been inserted into the subject's genome and isolating 

CC Shh/marker gene expressing cells and Shh/marker gene non-expressing cells 

CC from the selected tissue. The invention further provides a method of 

CC identifying differentially expressed genes (e.g. transmembrane factors TM 
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CC -1 and TM-2, EST 1437418, Mmu-miR-135a-2 and AP-2 beta) in selected 

CC tissues co-expressing the sonic hedgehog gene and a marker gene. The 

CC invention is useful in tissue engineering, regeneration, reconstruction 

CC and/or repair of tissues and genitourinary system and also in treating 

CC intervertebral disk rupture, degeneration, disease or injury and back 

CC pain. The invention is further useful for generating transgenic animal. 

CC The present sequence is the mouse TM-1 (Tmeml6a) protein. 
XX 

SQ Sequence 1017 AA; 

Query Match 30.1%; Score 1489; DB 12; Length 1017; 

Best Local Similarity 37.5%; Pred. No. 3e-143; 

Matches 361; Conservative 170; Mismatches 303; Indels 128; Gaps 29; 

Qy 26 GLYCRDQAHAERWAMT— SETSSGSHCARSRMLRRRAQEEDSTVLIDVSPPEAEKRGSYG 83 

I I I I I : : : : I I I I I I I : I : I : 

Db 109 GLYFRDGKRKVDYILVYHHKRASG SRTLARRGLQNDMVL GTRS 151 

Qy 8 4 STAHASEPGGQQAAACRAGSPAKPRIADFVLVWEEDLKLDRQQDSAARDRTDMHRTWRET 143 

M : : I I I I I : I : I III 
Db 152 VRQDQPLPG— KGSPVDAGSPEVP MDYHEDD KRFRREE 18 7 

Qy 144 FLDNLRAAGLCVDQQDVQDGNTTVH YALLSASWAVLCYYAEDLRLKLPLQELPNQAS 200 

: II III:: I : I : I : : I I I I I I I I : I I : I : : : : : 

Db 188 YEGNLLEAGLELE NDEDTKIHGVGFVKIHAPWHVLCREAEFLKLKMPTKKVYHISE 243 

Qy 201 NWSAGLLAWLGIPNVLLEVVPDVPPEYYSCRFRVNKLPRFLGS DNQDTFFT 251 

: I I I I : I I : : : I : I : I : I : : I : I I 

Db 244 — TRGLLK — TINSVLQKITDPIQPKVAEHRPQTTKRLSYPFSREKQHLFDLTDRDSFFD 299 

Qy 252 STKRHQILFEILAKTPYGHEKKNLLGIHQLLAEGVLSAAFPLHDGPFKTPPEGPQAPRLN 311 

I I I : : I I I : I I : : II I I I I I I I I : I I I I I : II I 

Db 300 SKTRSTIVYEILKRTTCTKAKYS-MGITSLLANGVYSAAYPLHDGDY EGDNV-EFN 353 

Qy 312 QRQVLFQHWARWGKWNKYQPLDHVRRYFGEKVALYFAWLGFYTGWLLPAAWGTLVFLVG 3 71 

I : : I : : II : I : I I I I : I I I : I I I I I I I I I I I I I II I : I I : : I I : I I I I 
Db 354 DRKLLYEEWASYGVFYKYQPIDLVRKYFGEKVGLYFAWLGAYTQMLIPASIVGVIVFLYG 413 

Qy 372 CFLVFSDIPTQELCGSKDSFEMCPLC-LDCPFWLLSSACALAQAGRLFDHGGTVFFSLFM 430 

I I : I I : I : I : : I I I I I I : I : I I I I I I : I I I I : I I I I I : I I 
Db 414 CATVDENIPSMEMCDQRYNITMCPLCDKTCSYWKMSSACATARASHLFDNPATVFFSVFM 4 73 

Qy 431 ALWAVLLLEYWKRKSATLAYRWDCSDYEDTEE RPRPQFAA SAPMTAPNPIT 481 

I I I I : I : I I I I I I I I I : : I : I I I I : : I I : I I 

Db 474 ALWAATFMEHWKRKQMRLNYRWDLTGFEEEEEAVKDHPRAEYEARVLEKSLRKESRNKET 533 

Qy 482 GEDEPYFPERSRARRMLAGSWIWMVAVWMCLVSIILYRAIMAIWSRSGNTLLAAWA 5 41 

I : II I I : I : I I : : : I : I I I : : : : : : 

Db 534 — DKVKLTWRDRFPAYFTNLVSIIFMIAVTFAIVLGVIIYRISTAAALAMNSSPSVRSNI 591 



http://es/ScoreAccessWeb/GetItem.action?AppId=10552...4_083145_us-10-552-515-l.rag&ItemType==4&startByte=0 (30 of 44)10/10/2008 8:48:10 AM 



SCORE Search Results Details for Application 10552515 and Search Result 20080624_083145_us-10-552-515-l.rag. 



Qy 542 SRIASLTGSWNLVFILILSKIYVSLAHVLTRWEMHRTQTKFEDAFTLKVFIFQFVNFYS 601 

: I : : I I I I : : I : : I : I | | : I : : I : I I : | | | : : M | | : 

Db 592 RVTVTATAVIINLWIILLDEVYGCIARWLTKIEVPKTEKSFEERLTFKAFLLKFVNSYT 651 

Qy 6 02 SPVYIAFFKGRFVGYPGNYHTLF-GVRNEECAAGGCLIELAQELLVIMVGKQVI-NNMQE 659 

1:111111111 I I : I : I I I I I I I I I I : I I : I : I I : I I I : I II: I 
Db 652 PIFYVAFFKGRFVGRPGDYVYIFRSFRMEECAPGGCLMELCIQLSIIMLGKQLIQNNLFE 711 

Qy 66 0 VLIPKLKGWWQKFRLRSKKRKAGASAGASQGPWEDDYELVPCEGLFDEYLEMVLQFGFVT 719 

: I M : I : : : I I : : : I I : I I II I I : I I :: I I I I I I 

Db 712 IGIPKMKKFIRYLKLRRQSPSDREEYVKRKQRYEVDFNLEPFAGLTPEYMEMIIQFGFVT 771 

Qy 72 0 IFVAACPLAPLFALLNNWVEIRLDARKFVCEYRRPVAERAQDIGIWFHILAGLTHLAVIS 7 79 

: I I I : I I I I I I I I I I I : M I I I I : I I I I I I I I I II : I I I I I :: I I I : I I I I 
Db 7 72 LFVASFPLAPLFALLNNIIEIRLDAKKFVTELRRPVAIRAKDIGIWYNILRGVGKLAVII 831 

Qy 780 NAFLLAFSSDFLPRAYYRWTRAHD— LRGFLNFTLARAPSSF AAAHN 82 4 

I I I : : : I : I I I : I I I : : : : I I : I I I III II 

Db 832 NAFVISFTSDFIPRLVYLYMYSQNGTMHGFVNHTL SSFNVSDFQNGTAPNDPLDLG 88 7 

Qy 825 RTCRYRAFRD DDGHY — SQTYWNLLAIRLAFVIVFEHVVFSVGRLLDLLVPDIP 8 76 

: I II : : I : : I I : : I : I I I I I I I I I I : : : I : : I : : I I I I 

Db 888 YEVQICRYKDYREPPWSEHKYDISKDFWAVLAARLAFVIVFQNLVMFMSDFVDWVIPDIP 947 

Qy 877 ESVEIKVKREYYL AKQALAENEVLFGTNGTKDEQPKGSELSSHWTPFTVP 926 

: : : : : I I I I I : : : I : I : : : I : I I I 

Db 948 KDISQQIHKEKVLMVELFMREEQGKQQLLDTWM EKEKPRDVPCNNH-SPTTHP 999 

Qy 927 KA 928 

: I 

Db 1000 EA 1001 



RESULT 11 
AEG11142 



ID AEG11142 standard; protein; 960 AA. 

XX 

AC AEG11142; 
XX 

DT 15-JUN-2007 (revised) 

DT 20-APR-2006 (first entry) 

XX 

DE Human transmembrane protein 16A, SEQ ID NO: 7. 

XX 

KW Genetic marker; diagnostic; prognosis; gastrointestinal tumor; 

KW cytostatic; neoplasm; tumor marker; transmembrane protein 16A; BOND_PC; 

KW transmembrane protein 16A; 

KW transmembrane protein 16A (eight membrane-spanning domains); 
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KW oral cancer overexpressed 2; membrane protein; 

KW tumor amplified and overexpressed sequence 2; 

KW transmembrane protein 16A [Homo sapiens]; TMEM16A; TA0S2; 0RA0V2; 

KW FLJ10261. 

XX 

OS Homo sapiens. 
XX 

PN US2006040292-A1 . 

XX 

PD 23-FEB-2006. 

XX 

PF 08-JUL-2005; 2005US-00177894 . 
XX 

PR 08-JUL-2004; 2004US-0586676P . 

XX 

PA (WEST/) WEST R B. 

PA (VRIJ/) VAN DE RUN M. 

XX 

PI West RE, Van De Rijn M; 
XX 

DR WPI; 2006-182760/19. 

DR N-PSDB; AEG11136. 

DR REFSEQ; NP_060513. 

DR PCiNCBI; gi40354210. 
XX 

PT Classifying tumor as gastrointestinal stromal tumor belonging to PDGFRA 

PT positive subclass, involves detecting expression or activity of gene 

PT encoding DOGl polypeptide in sample. 
XX 

PS Disclosure; SEQ ID NO 7; 177pp; English. 

XX 

CO The present invention relates to three gene markers such as DOGl, KIT and 

CO platelet derived-growth factor receptor alpha (PDGFRA) that are useful in 

CO classifying tumors. These gene markers are useful in the classification 

CO of gastrointestinal stromal tumors (GISTs) and tumors other than GISTs. 

CC The invention also relates to methods providing diagnostic, prognostic 

CC and predicative information based on the classifying step. The invention 

CC is useful for classifying gastrointestinal stromal tumors as belonging to 

CC a PDGFRA positive subclass, KIT negative or PDGFRA negative subclass. The 

CC present sequence is human transmembrane protein 16A (DOGl; TMEM16A) . The 

CC DOGl gene encodes a transmembrane protein of unknown function 

CC (transmembrane protein 16A) . The transmembrane protein 16A is encoded by 

CC DOGl gene that is mapped to llql3.2 on chromosome 11. 
CC 

CC Revised record issued on 15-JUN-2007 : Enhanced with precomputed 

CC information from BOND. 

XX 

SQ Sequence 96 0 AA; 
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Query Match 30.1%; Score 1488; DB 11; Length 960; 

Best Local Similarity 37.6%; Pred. No. 3.5e-143; 

Matches 363; Conservative 160; Mismatches 307; Indels 136; Gaps 28; 

Qy 26 GLYCRDQAHAERWAMT— SETSSGSHCARSRMLRRRAQEEDSTVLIDVSPPEAEKRGSYG 83 

Mill : : : I I : I I I I I I : I 
Db 52 GLYFRDGRRKVDYILVYHHKRPSG NRTLVRRVQHSDTP SGA 92 

Qy 8 4 STAHASEPGGQQAAACRAGSPAKPRIADFVLVWEEDLKLDRQQDSAARDRTDMHRTWRET 143 

: I : I : I I I I : I : I III 
Db 93 RSVKQDHPLPGKGASLDAGSGEPP MDYHEDD KRFRREE 13 0 

Qy 144 FLDNLRAAGLCVDQQDVQDGNTTVH YALLSASWAVLCYYAEDLRLKLPLQELPNQAS 200 

: II III:: : I : I : I : : I I I I I I I I : I I : I : : : : 

Db 131 YEGNLLEAGLELE RDEDTKIHGVGFVKIHAPWNVLCREAEFLKLKMPTKKMYH— I 18 4 

Qy 201 NWSAGLLAWLGIPNVLLEVVPDVPPEYYSCR FRVNKLPRFLGSDNQDTFF 25 0 

I : I I I I : I I : : : I : I I I | | | : | : | | 

Db 185 NETRGLLK — KINSVLQKITDPIQPKVAEHRPQTMKRLSYPFSREKQHLFDLSD-KDSFF 2 41 

Qy 251 TSTKRHQILFEILAKTPYGHEKKNLLGIHQLLAEGVLSAAFPLHDGPFKTPPEGPQAPRL 310 

I I I : : I I 1 : I I : : I I I I I I I : I I : I I I I I : : 
Db 242 DSKTRSTIVYEILKRTTCTKAKYS-MGITSLLANGVYAAAYPLHDGDY NGENVEF 295 

Qy 311 NQRQVLFQHWARWGKWNKYQPLDHVRRYFGEKVALYFAWLGFYTGWLLPAAWGTLVFLV 3 70 

I I : : I : : I I I : I : I I I I : I I I : I I I I I : I I I I I I I II I : I I : : I I : I I I 

Db 296 NDRKLLYEEWARYGVFYKYQPIDLVRKYFGEKIGLYFAWLGVYTQMLIPASIVGIIVFLY 355 

Qy 371 GCFLVFSDIPTQELCGSKDSFEMCPLC-LDCPFWLLSSACALAQAGRLFDHGGTVFFSLF 429 

II : : I I : I : I : : I I I I I I : I : I I I I I I : I III: I I I I I : I 

Db 356 GCATMDENIPSMEMCDQRHNITMCPLCDKTCSYWKMSSACATARASHLFDNPATVFFSVF 415 

Qy 430 MALWAVLLLEYWKRKSATLAYRWDCSDYEDTEE RPRPQFAA SAPMTAPNPI 480 

I I I I I : I : I I I I I I I I I : : I : I I I I : : I I : I 

Db 416 MALWAATFMEHWKRKQMRLNYRWDLTGFEEEEEAVKDHPRAEYEARVLEKSLKKESRNKE 4 75 

Qy 481 TGEDEPYFPERSRARRMLAGSVVIWMVAWVMCLVSIILYRAIMAIWSRSGNTLLAAW 540 

I I : II I I I : I : I I : : : I : I I M : : : : : : 

Db 476 T — DKVKLTWRDRFPAYLTNLVSIIFMIAVTFAIVLGVIIYRISMAAALAMNSSPSVRSN 533 

Qy 541 ASRIASLTGSWNLVFILILSKIYVSLAHVLTRWEMHRTQTKFEDAFTLKVFIFQFVNFY 600 

: I : : I I I I : : I : : I : I I I : I : : I : I I : | | : : | | | | 

Db 53 4 IRVTVTATAVIINLVVIILLDEVYGCIARWLTKIEVPKTEKSFEERLIFKAFLLKFVNSY 5 93 

Qy 601 SSPVYIAFFKGRFVGYPGNYHTLF-GVRNEECAAGGCLIELAQELLVIMVGKQVI-NNMQ 658 

: 1:111111111 I I : I : I I I I I I I I I I : I I : I : I I : I I I : I II: 
Db 594 TPIFYVAFFKGRFVGRPGDYVYIFRSFRMEECAPGGCLMELCIQLSIIMLGKQLIQNNLF 653 

Qy 659 EVLIPKLKGWWQKFRLRSKKRKAGASAGASQGPWEDDYELVPCEGLFDEYLEMVLQFGFV 718 
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I : I I I : I : : I : : : : I M I I II I I : I I :: I I I I I 

Db 654 EIGIPKMKKLIRYLKLKQQSPPDHEECVKRKQRYEVDYNLEPFAGLTPEYMEMIIQFGFV 713 

Qy 719 TIFVAACPLAPLFALLNNWVEIRLDARKFVCEYRRPVAERAQDIGIWFHILAGLTHLAVI 778 

I : I I I : I I I I I I I I I I I : II II II : II I I I II II I I : I I I I I :: I I I : I I I I 
Db 714 TLFVASFPLAPLFALLNNIIEIRLDAKKFVTELRRPVAVRAKDIGIWYNILRGIGKLAVI 773 

Qy 779 SNAFLLAFSSDFLPRA— YYRWTRAHDLRGFLNFTLARAPSSF AAAHN 824 

I I I : : : I : I I I : I I I : : : : I I : I I I III II 

Db 774 INAFVISFTSDFIPRLVYLYMYSKNGTMHGFVNHTL SSFNVSDFQNGTAPNDPLDL 82 9 

Qy 825 RTCRYRAFRD DDGHY— SQTYWNLLAIRLAFVIVFEHWFSVGRLLDLLVPDI 8 75 

: I I I : : I : : I I : : I : I I I I I I I I I I : : : I : : I : : I I I 

Db 83 0 GYEVQICRYKDYREPPWSENKYDISKDFWAVLAARLAFVIVFQNLVMFMSDFVDWVIPDI 889 

Qy 8 76 PESVEIKVKREYYLA KQALAENEVLFGTNGTKDEQP KG 913 

I : : : : : I I I I I I I I I I I 

Db 890 PKDISQQIHKEKVLMVELFMREEQDKQQLL— ETWMEKERQKDEPPCNHHNTKACPDSLG 947 

Qy 914 SELSSH 919 

I I I 

Db 948 SPAPSH 953 



RESULT 12 
AEG11146 

ID AEG11146 standard; protein; 840 AA. 

XX 

AC AEG11146; 
XX 

DT 15-JUN-2007 (revised) 

DT 20-APR-2006 (first entry) 

XX 

DE Human transmembrane protein 16A, SEQ ID NO: 11. 

XX 

KW Genetic marker; diagnostic; prognosis; gastrointestinal tumor; 

KW cytostatic; neoplasm; tumor marker; transmembrane protein 16A; BOND_PC; 

KW TMEM16A protein; TMEM16A protein [Homo sapiens] . 

XX 

OS Homo sapiens . 

XX 

PN US2006040292-A1 . 

XX 

PD 23-FEB-2006. 

XX 

PF 08-JUL-2005; 2005US-00177894 . 
XX 

PR 08-JUL-2004; 2004US-0586676P . 
XX 
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PA (WEST/) WEST R B. 

PA (VRIJ/) VAN DE RUN M. 

XX 

PI West RB, Van De Rijn M; 
XX 

DR WPI; 2006-182760/19. 

DR N-PSDB; AEG11141. 

DR GENBANK; AAH33036. 

DR PC:NCBI; gi34192278. 
XX 

PT Classifying tumor as gastrointestinal stromal tumor belonging to PDGFRA 

PT positive subclass, involves detecting expression or activity of gene 

PT encoding DOGl polypeptide in sample. 

XX 

PS Disclosure; SEQ ID NO 11; 177pp; English. 

XX 

CO The present invention relates to three gene markers such as DOGl, KIT and 

CO platelet derived-growth factor receptor alpha (PDGFRA) that are useful in 

CO classifying tumors. These gene markers are useful in the classification 

CO of gastrointestinal stromal tumors (GISTs) and tumors other than GISTs. 

CC The invention also relates to methods providing diagnostic, prognostic 

CO and predicative information based on the classifying step. The invention 

CC is useful for classifying gastrointestinal stromal tumors as belonging to 

CC a PDGFRA positive subclass, KIT negative or PDGFRA negative subclass. The 

CC present sequence is human transmembrane protein 16A (DOGl; TMEM16A) . The 

CC DOGl gene encodes a transmembrane protein of unknown function 

CC (transmembrane protein 16A) . The transmembrane protein 16A is encoded by 

CC DOGl gene that is mapped to llql3.2 on chromosome 11. 
CC 

CC Revised record issued on 15-JUN-2007 : Enhanced with precomputed 

CC information from BOND. 

XX 

SQ Sequence 840 AA; 



Query Match 29.9%; Score 1479.5; DB 11; Length 840; 

Best Local Similarity 40.0%; Pred. No. 2.2e-142; 

Matches 340; Conservative 152; Mismatches 270; Indels 89; Gaps 22; 

Qy 135 DMHRTWRETFLDNLRAAGLCVDQQDVQDGNTTVH YALLSASWAVLCYYAEDLRLKLP 191 

I I II: II III:: : I : I : I : : I I I I I I I I : I I : I 

Db 6 DDKRFRREEYEGNLLEAGLELE RDEDTKIHGVGFVKIHAPWNVLCREAEFLKLKMP 61 



Qy 192 LQELPNQASNWSAGLLAWLGIPNVLLEWPDVPPEYYSCR FRVNKLPRFL 2 41 

: : : : I : I I I I : I I : : : I : I I I I 

Db 6 2 TKKMYH — INETRGLLK — KINSVLQKITDPIQPKVAEHRPQTMKRLSYPFSREKQHLFD 117 



Qy 242 GSDNQDTFFTSTKRHQILFEILAKTPYGHEKKNLLGIHQLLAEGVLSAAFPLHDGPFKTP 301 

I I : I : I I I I I : : I I I : I I : : I I I I I I I : I I : I I I I I : 

Db 118 LSD-KDSFFDSKTRSTIVYEILKRTTCTKAKYS-MGITSLLANGVYAAAYPLHDGDY 172 
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Qy 302 PEGPQAPRLNQRQVLFQHWARWGKWNKYQPLDHVRRYFGEKVALYFAWLGFYTGWLLPAA 361 

: I I : : I : : I I I : I : I I I I : I I I : I I I I I : I I I I I I I II I : M : 

Db 173 — NGENVEFNDRKLLYEEWARYGVFYKYQPIDLVRKYFGEKIGLYFAWLGVYTQMLIPAS 23 0 

Qy 36 2 WGTLVFLVGCFLVFSDIPTQELCGSKDSFEMCPLC-LDCPFWLLSSACALAQAGRLFDH 42 0 

: I I : I I I I I : : I I : I : I : : I I I I I I : I : I I I I I I : I III: 
Db 231 IVGIIVFLYGCATMDENIPSMEMCDQRHNITMCPLCDKTCSYWKMSSACATARASHLFDN 290 

Qy 421 GGTVFFSLFMALWAVLLLEYWKRKSATLAYRWDCSDYEDTEERPRPQFAA SAPMT 475 

11111:111111 : I : I I I I I I I I I : : I : I : II : : I I 
Db 291 PATVFFSVFMALWAATFMEHWKRKQMRLNYRWDLTGFEEEEDHPRAEYEARVLEKSLKKE 35 0 

Qy 4 76 APNPITGEDEPYFPERSRARRMLAGSWIWMVAWVMCLVSIILYRAIMAIVVSRSGNT 535 

: I I I : II I I I : I : I I : : : | : | | | | : : : : 

Db 351 SRNKET — DKVKLTWRDRFPAYLTNLVSIIFMIAVTFAIVLGVIIYRISMAAALAMNSSP 408 

Qy 536 LLAAWASRIASLTGSWNLVFILILSKIYVSLAHVLTRWEMHRTQTKFEDAFTLKVFIFQ 5 95 

: : : I : : I I I I : : I : : I : I I I : I : : I : I I : II:: 

Db 409 SVRSNIRVTVTATAVIINLWIILLDEVYGCIARWLTKIEVPKTEKSFEERLIFKAFLLK 46 8 

Qy 596 FVNFYSSPVYIAFFKGRFVGYPGNYHTLF-GVRNEECAAGGCLIELAQELLVIMVGKQVI 654 

III I : 1:111111111 I I : I : I I I I I I Nihil : I : I I : I I I : I 
Db 46 9 FVNSYTPIFYVAFFKGRFVGRPGDYVYIFRSFRMEECAPGGCLMELCIQLSIIMLGKQLI 52 8 

Qy 655 -NNMQEVLIPKLKGWWQKFRLRSKKRKAGASAGASQGPWEDDYELVPCEGLFDEYLEMVL 713 

I I : I : I I I : I : : I : : : : I I I I I II I I : I I : : 

Db 52 9 QNNLFEIGIPKMKKLIRYLKLKQQSPPDHEECVKRKQRYEVDYNLEPFAGLTPEYMEMII 588 

Qy 714 QFGFVTIFVAACPLAPLFALLNNWVEIRLDARKFVCEYRRPVAERAQDIGIWFHILAGLT 7 73 

111111:111: I I I I I I I I I I I : I I I I I I : I I I I I I I I I I I : I I I I I :: I I I : 
Db 589 QFGFVTLFVASFPLAPLFALLNNIIEIRLDAKKFVTELRRPVAVRAKDIGIWYNILRGIG 6 48 

Qy 774 HLAVISNAFLLAFSSDFLPRA— YYRWTRAHDLRGFLNFTLARAPSSF AAAHN 82 4 

I I I I : I I : : : I : I I I : I I I : : : : I I : I I I III II 

Db 649 KLAVIIDAFVISFTSDFIPRLVYLYMYSKNGTMHGFVNHTL SSFNVSDFQNGTAPN 704 

Qy 825 RTCRYRAFRD DDGHY — SQTYWNLLAIRLAFVIVFEHVVFSVGRLLDL 8 70 

: I I I : : I : : I I : : I : I I I I I I I I I I : : : I : : I 
Db 705 DPLDLGYEVQICRYKDYREPPWSENKYDISKDFWAVLAARLAFVIVFQNLVMFMSDFVDW 76 4 

Qy 871 LVPDIPESVEIKVKREYYLA KQALAENEVLFGTNGTKDEQP 911 

: : I I M : : : : : I I III I I I I I 

Db 765 VIPDIPKDISQQIHKEKVLMVELFMREEQDKQQLL— ETCMEKERQKDEPPCNHHNTKAC 822 

Qy 912 KGSELSSH 919 

II II 

Db 823 PDSLGSPAPSH 833 



http://es/ScoreAccessWeb/GetItem.action?AppId=10552...4_083145_us-10-552-515-l.rag&ItemType==4&startByte=0 (36 of 44)10/10/2008 8:48:10 AM 



SCORE Search Results Details for Application 10552515 and Search Result 20080624_083145_us-10-552-515-l.rag. 



RESULT 13 
ADG48280 

ID ADG48280 standard; protein; 1003 AA. 

XX 

AC ADG4828 0; 
XX 

DT ll-MAR-2004 (first entry) 

XX 

DE Human retina-specific protein - C12orf 3variants . 

XX 

KW human; retina-specific protein; NETOl; retinal disease; 

KW age related macular degeneration; night blindness; C12orf 3variants . 

XX 

OS Homo sapiens . 
XX 

PN WO2003068967-A2 . 
XX 

PD 21-AUG-2003. 
XX 

PF 18-FEB-2003; 2003WO-EP001625 . 
XX 

PR 18-FEB-2002; 2002EP-000036 75 . 

PR 21-FEB-2002; 2002US-0357857P . 
XX 

PA (LYNK-) LYNKEUS BIO TECH GMBH. 

XX 

PI Stoehr BH, Weber FHB, Goehring F; 
XX 

DR WPI; 2003-767334/72. 

DR N-PSDB; ADG48279. 

XX 

PT New nucleic acid encoding retinal protein sNETOl, useful for diagnosis of 

PT retinal disease, especially macular degeneration, also for drug screening 

PT and therapy. 

XX 

PS Claim 18; Fig 14; 199pp; English. 

XX 

CC The invention comprises the amino acid and coding sequences of a human 

CC retina-specific protein - NETOl. The DNA and protein sequences of the 

CC invention are useful in the treatment of retinal diseases, such as 

CC macular degeneration (especially age related) and night blindness. The 

CC present amino acid sequence represents the human retina-specific protein 

CC C12orf3variants . 

XX 

SQ Sequence 1003 AA; 

Query Match 29.6%; Score 1464; DB 7; Length 1003; 
Best Local Similarity 37.4%; Fred. No. l.le-140; 
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Matches 344; Conservative 167; Mismatches 284; Indels 124; Gaps 27; 

Qy 80 GSYGSTAHASEPGGQQAAACRAGSPAKPRIADFVLVWEEDLKLDRQQDSAARDRTDMHRT 13 9 

I I I I I I I : : I I : : I : 
Db 129 GETGKEPHAGGPG DIELG-PLDALEEERKEQ 15 8 

Qy 140 WRETFLDNLRAAGLCVDQQDVQDGNTTVHYALLSASWAVLCYYAEDLRLKLPLQELPNQ- 198 

III II I I I : : : I : : : : : : I I I I | | | : : | : | : : : 

Db 159 -REEFEHNLMEAGLEL-EKDLENKSQGSIFVRIHAPWQVLAREAEFLKIKVPTKKEMYEI 216 

Qy 19 9 ASNWSAGLLAWLGIPNVLLEWPDVPPEYYSCRFRVNKLP RFLGSDNQ 2 46 

1:111 : : I I I I : : : : | : 

Db 217 KAGGSIAKKFSAAL QKLSSHLQPRV-PEHSNNKMKNLSYPFSREKMYLYNIQEK 26 9 

Qy 247 DTFFTSTKRHQILFEILAKTPYGHEKKNLLGIHQLLAEGVLSAAFPLHDGPFKTPPEGPQ 306 

Mil: I : I : I I I : I | : | | : | : | : | | : j jj jj : : | : 

Db 270 DTFFDNATRSRIVHEILKRTACS-RANNTMGINSLIANNIYEAAYPLHDGEYDSPEDD — 326 

Qy 30 7 APRLNQRQVLFQHWARWGKWNKYQPLDHVRRYFGEKVALYFAWLGFYTGWLLPAAVVGTL 36 6 

: I I : : I : I I I I : I : I : I I : I : I : I I I I I : I I I I I II II : I : I : : I : I : 

Db 327 MNDRKLLYQEWARYGVFYKFQPIDLIRKYFGEKIGLYFAWLGLYTSFLIPSSVIGVI 383 

Qy 36 7 VFLVGCFLVFSDIPTQELCGSKDSFEMCPLC-LDCPFWLLSSACALAQAGRLFDHGGTVF 425 

I I I I I : I I I : : I : I : : : I I I I I I I : I I I I I I III III: III 
Db 384 VFLYGCATIEEDIPSREMCDQQNAFTMCPLCDKSCDYWNLSSACGTAQASHLFDNPATVF 443 

Qy 426 FSLFMALWAVLLLEYWKRKSATLAYRWDCSDYEDTEER PRPQFAA 470 

11:111111:11111 MM: I : I I I I I : : 

Db 444 FSIFMALWATMFLENWKRLQMRLGYFWDLTGIEEEEERAQEHSRPEYETKVREKMLKESN 5 03 

Qy 471 -SAPMTAPNPIT GEDEPYFPERSRARRMLAGSWIVVMVAWVMCLVS I ILYRAIM 525 

II I : I I : I I I : I : I : : : I : I I 

Db 504 QSAVQKLETNTTECGDEDDEDKLTWKDRFPGYLMNFASILFMIALTFSIVFGVIVYRITT 563 

Qy 526 AIWSRSGNTLLAAWASRI ASLTGSWNLVFILILSKIYVSLAHVLTRWEMHRTQTK 582 

I : I I I I : : I : : I I I I I I I : I I : : I I I : I : : I : 

Db 56 4 AAALS LNKATRSNVRVTVTATAVIINLWILILDEIYGAVAKWLTKIEVPKTEQT 618 

Qy 583 FEDAFTLKVFIFQFVNFYSSPVYIAFFKGRFVGYPGNYHTLF-GVRNEECAAGGCLIELA 6 41 

II: II I : : I I I II I : I I I I I I I I I I I : I : I I I I I I I I I I I : I I 
Db 619 FEERLILKAFLLKFVNAYSPIFYVAFFKGRFVGRPGSYVYVFDGYRMEECAPGGCLMELC 6 78 

Qy 6 42 QELLVIMVGKQVI-NNMQEVLIPKLKGWWQKFRLRSKKRKAGASAGA-SQGP — WEDDYE 697 

: I : I I : I I I : I I I : I : : I I I I : I I : : I I : I I : I I : I I 

Db 679 IQLSIIMLGKQLIQNNIFEIGVPKLK KLFRKLKDETEAGETDSAHSKHPEQWDLDYS 735 

Qy 698 LVPCEGLFDEYLEMVLQFGFVTIFVAACPLAPLFALLNNWVEIRLDARKFVCEYRRPVAE 75 7 

I I II I I : I I : : I I I I I I : I I I : I I I I : I I I I I I : I : I I I I : I I I I III I 

Db 736 LEPYTGLTPEYMEMIIQFGFVTLFVASFPLAPVFALLNNVIEVRLDAKKFVTELRRPDAV 795 
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Qy 758 RAQDIGIWFHILAGLTHLAVISNAFLLAFSSDFLPRAYYRWTRAHD— LRGFLNFTLA— 813 

I : I I I I I I I I : I : : I I I I I I : : I : I I I : I I I : : : : I : I I I : I I I : 

Db 796 RTKDIGIWFDILSGIGKFSVISNAFVIAITSDFIPRLVYQYSYSHNGTLHGFVNHTLSFF 855 

Qy 814 RAPSSFAAAHNRTCRYRAFRD DDGHYSQTYWNLLAIRLAFVIVFEH 85 9 

: : I : I I : : : I : : : | : | | : | : | | | | | | : | : : 

Db 856 NVSQLKEGTQPENSQFDQEVQFCRFKDYREPPWAPNPYEFSKQYWFILSARLAFVIIFQN 915 

Qy 86 0 WFSVGRLLDLLVPDIPESVEIKVKRE YYLAKQALAENEVLFGTNGTKDEQPKG 913 

: I : I : I : : I I I I : : : I : I : : I : I : I I : II 

Db 916 LVMFLSVLVDWMIPDIPTDISDQIKKEKSLLVDFFLKE EHEKLKLMDEPALRSPGG 971 

Qy 914 SELSSHWTPFTVPKA-SQL 931 

: I : I III 

Db 972 GDRSRSRAASSAPSGQSQL 990 



RESULT 14 
AEH82071 



AEH82071 standard; protein; 913 AA. 



AC 
XX 
DT 
DT 
XX 
DE 
XX 
KW 
KW 
KW 
KW 
KW 
XX 
OS 
XX 
FH 
FT 
FT 
FT 



15-JUN-2007 
13-JUL-2006 



(revised) 

(first entry) 



Human gnathodiaphyseal dysplasia protein, GDDl . 

Osteopathic; Gene therapy; bone disease; bone injury; bone resorption; 
gnathodiaphyseal dysplasia; GDDl; BOND_PC; transmembrane protein 16E; 
integral membrane protein GDDl; transmembrane protein 16E [Homo sapiens]; 
TMEM16E; GDDl; integral membrane protein GDDl [Homo sapiens]; G05783; 
GO16020; GO16021. 

Homo sapiens . 

Key Location/Qualifiers 

Inhibitory-site 356 

/note= "Missense mutations in the coding sequence can 
lead to substitution of this residue with either Arg or 
Gly" 



JP2006121961-A. 



PD 18-MAY-2006. 

XX 

PF 28-OCT-2004; 2004JP-00313511 . 
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PR 28-OCT-2004; 2004JP-00313511 . 
XX 

PA (UYTO-) UNIV TOKUSHIMA NAT UNIV CORP. 

XX 

PI Itakura M, Tsutsumi S, Kamata N, Inoue H; 

XX 

DR WPI; 2006-367194/38. 

DR N-PSDB; AEH82070. 

DR PCiNCBI; gi47106048. 

DR PC:SWISSPROT; Q75V66 . 

XX 

PT Novel gnathodiaphseal dysplasia DNA, useful as diagnostic agent for bone 

PT disease such as gnathodiaphseal dysplasia, bone deficiency or bone- 

PT resorption property disease. 
XX 

PS Claim 9; SEQ ID NO 2; llpp; Japanese. 
XX 

CC The present invention relates to a human gnathodiaphyseal dysplasia (GDD) 

CC coding sequence (GDDl; AEH82070) and encoded protein (AEH82071). GDDl is 

CC useful as a bone disease diagnostic agent, where the bone disease is GDD, 

CC bone deficiency and/or bone-resorpt ion property disease, where the GDD 

CC disease causes hardening of bone, susceptibility to fracture, cement bone 

CC pathology of a lower jaw bone, etc. GDDl is also useful in bone formation 

CC regeneration, hard tissue reconstruction, etc., and in research 

CC application. 
CC 

CC Revised record issued on 15-JUN-2007 : Enhanced with precomputed 

CC information from BOND. 

XX 

SQ Sequence 913 AA; 

Query Match 29.4%; Score 1455; DB 11; Length 913; 

Best Local Similarity 38.6%; Pred. No. 8.4e-140; 

Matches 325; Conservative 154; Mismatches 276; Indels 86; Gaps 22; 

Qy 10 8 RIADFVLVWEEDLKLDRQQDSAARDRTDMHRTWRETFLDNLRAAGL CVDQQDVQDGN 16 4 

I I I I I : : I : I I : : I : I III II I : : I : I I 

Db 7 8 RQIDFVLSYVDDVKKD AELKAERRKEFETNLRKTGLELEIEDKRDSEDGR 12 7 

Qy 165 TTVHYALLSASWAVLCYYAEDLRLKLPLQE— LPNQASNWSAGLLAWLGIPNVLLEWPD 222 

I : : : I I I I I I I I : I : I : : I : I : : I : : I I I 

Db 128 T — YFVKIHAPWEVLVTYAEVLGIKMPIKESDIPRPKHTPISYVLGPVRLP — LSVKYPH 183 

Qy 223 VPPEYYSCRFRVNKLPRFLGSDNQDTFFTSTKRHQILFEILAKTPYGHEK-KNLLGIHQL 281 

I I I : : : I : : II I I I I I I : I : : I : : I I : : I : I I I I I : I 
Db 184 — PEYFTAQFSRHRQELFLIED-QATFFPSSSRNRIVYYILSRCPFGIEDGKKRFGIERL 240 

Qy 282 LAEGVLSAAFPLHDGPFKTPPEGPQAPRLNQRQVLFQHWARWGKWNKYQPLDHVRRYFGE 341 

I I : I : I I I I I : III I : I I I : I I I : : | | | | | : : | : | | 
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Db 241 LNSNTYSSAYPLHDGQYWKPSEPPNP — TNERYTLHQNWARFSYFYKEQPLDLIKNYYGE 298 



Qy 342 KVALYFAWLGFYTGWLLPAAWGTLVFLVGCFLVFSDIPTQELCGSK — DSFEMCPLCLD 399 

I : : I I : I I I I I I I I I I I I : I : : : I : I : I I I I I 

Db 299 KIGIYFVFLGFYTEMLFFAAWGLACFIYGLLSMEHNTSSTEICDPEIGGQMIMCPLCDQ 358 

Qy 400 -CPFWLLSSACALAQAGRLFDHGGTVFFSLFMALWAVLLLEYWKRKSATLAYRWDCSDYE 458 

I : I I : I I : : III: I I I I : : I I : I I I I : I I :: I I I I I I : I 

Db 359 VCDYWRLNSTCLASKFSHLFDNESTVFFAIFMGIWVTLFLEFWKQRQARLEYEWDLVDFE 418 

Qy 45 9 DTEE— RPRPQFAASAPMTAPNPITGEDEPYFPERSRARRMLAGSVVIVVMVAWVMCLV 516 

: : : : I I : I I I : I I I I I I : I : : : : : I I : | 

Db 419 EEQQQLQLRPEFEAMCKHRKLNAVTKEMEPYMPLYTRIPWYFLSGATVTLWMSLWTSMV 4 78 

Qy 517 SIILYRAIMAIWSRSGNTLLAAWASRI ASLTGSVVNLVFIL 558 

: : I : I I : : I : I I : I II II : I : I I 

Db 479 AVIVYRL SVFATFASFMESDASLKQVKSFLTPQITTSLTGSCLNFIVIL 52 7 

Qy 55 9 ILSKIYVSLAHVLTRWEMHRTQTKFEDAFTLKVFIFQFVNFYSSPVYIAFFKGRFVGYPG 618 

II: I : : : I : I : I I : : I : I I I : I : I I I I I I I I I I : I I I I I : I I I I I I 

Db 528 ILNFFYEKISAWITKMEIPRTYQEYESSLTLKMFLFQFVNFYSSCFYVAFFKGKFVGYPG 587 

Qy 619 NYHTLFGV-RNEECAAGGCLIELAQELLVIMVGKQVINNMQEVLIPKLKGWWQKFRLRSK 6 77 

I II 1:111 I I I I I I I : I : I I I I I : I : : I : I II : 
Db 588 KYTYLFNEWRSEECDPGGCLIELTTQLTIIMTGKQIFGNIKEAIYPLALNWW R 6 40 

Qy 678 KRKAGASAGASQGPWEDDYELVPCE— GLFDEYLEMVLQFGFVTIFVAACPLAPLFALLN 735 

: I I I : : I I I : : I I I I I I I I I I I I I I I : I I I : I I I I I I I : I 

Db 641 RRKARTNSEKLYSRWEQDHDLESFGPLGLFYEYLETVTQFGFVTLFVASFPLAPLLALIN 700 

Qy 736 NWVEIRLDARKFVCEYRRPVAERAQDIGIWFHILAGLTHLAVISNAFLLAFSSDFLPRAY 795 

11111:111 : I I I I I : I I I : I III: I : I : I I I : : I I : I I : I I 
Db 701 NIVEIRVDAWKLTTQYRRTVASKAHSIGVWQDILYGMAVLSVATNAFIVAFTSDIIPRLV 76 0 

Qy 796 YRW TRAHDLRGFLN FTLARAPSSFAAAHNR TCRYRAFR DDDGHY- 83 9 

I : : I : : I I : I I : I : I I I I I I : I II: I 

Db 761 YYYAYSTNATQPMTGYVNNSLSVFLIADFPNHTAPSEKRDFITCRYRDYRYPPDDENKYF 82 0 

Qy 840 -SQTYWNLLAIRLAFVIVFEHVVFSVGRLLDLLVPDIPESVEIKVKREYYLAKQALAENE 898 

: : I : : I I : : I : I I I I I I I I II : : I I : I : I : : I I I : : I : I 
Db 821 HNMQFWHVLAAKMTFIIVMEHWFLVKFLLAWMIPDVPKDWERIKREKLMTIKILHDFE 880 

Qy 899V899 

Db 881L881 



RESULT 15 
ABB62812 
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ID ABB62812 standard; protein; 1219 AA. 
XX 

AC ABB62812; 

XX 

DT 15-JUN-2007 (revised) 

DT 26-MAR-2002 (first entry) 

XX 

DE Drosophila melanogaster polypeptide SEQ ID NO 15228. 

XX 

KW Drosophila; developmental biology; cell signalling; insecticide; 

KW pharmaceutical; BOND_PC; CG6938-PA; CG6938-PA [Drosophila melanogaster] ; 

KW CG6938. 

XX 

OS Drosophila melanogaster. 

XX 

FN WO200171042-A2 . 
XX 

PD 27-SEP-2001. 

XX 

PF 23-MAR-2001; 2001WO-US009231 . 
XX 

PR 23-MAR-2000; 2000US-0191637P . 

PR ll-JUL-2000; 2000US-00614150 . 
XX 

PA (PEKE ) PE CORP NY. 
XX 

PI Venter JC, Adams M, Li PWD, Myers EW; 
XX 

DR WPI; 2001-656860/75. 

DR N-PSDB; ABL06915. 

DR PC:NCBI; gi24663059. 

XX 

PT New isolated nucleic acid detection reagent for detecting 1000 or more 

PT genes from Drosophila and for elucidating cell signaling and cell-cell 

PT interactions. 

XX 

PS Disclosure; SEQ ID NO 15228; 21pp + Sequence Listing; English. 

XX 

CC The invention relates to an isolated nucleic acid detection reagent 

CC capable of detecting 1000 or more genes from Drosophila. The invention is 

CC useful in developmental biology and in elucidating cell signalling and 

CC cell-cell interactions in higher eukaryotes for the development of 

CC insecticides, therapeutics and pharmaceutical drugs. The invention 

CC discloses genomic DNA sequences (ABLl 6 1 76-ABL3 05 1 1 ) , expressed DNA 

CC sequences (ABL01840-ABL16175 ) and the encoded proteins (ABB57737- 

CC ABB72072) . The sequence data for this patent did not form part of the 

CC printed specification, but was obtained in electronic format directly 

CC from WIPO at ftp.wipo.int/pub/published_pct_sequences 

CC 
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CC Revised record issued on 15-JUN-2007 : Enhanced with precomputed 

CC information from BOND. 

XX 

SQ Sequence 1219 AA; 

Query Match 29.2%; Score 1445; DB 4; Length 1219; 

Best Local Similarity 35.6%; Pred. No. 1.4e-138; 

Matches 342; Conservative 165; Mismatches 332; Indels 122; Gaps 27; 

Qy 35 AERWAMTSETSSGSHCARSRML RRRAQEEDSTVLIDVSPPEAEKRGSY 82 

I : I : I II 1:1 I : I : j : j III 
Db 249 ADRVNQSYEVMESSH SNVLPDQFGYRQLIPTERKASDTASSV SGSY 294 

Qy 83 GSTAHASEP GGQQAAACRAGSPAKP RIADFVLVW-EEDLKLDRQ 125 

: I I : II: I : I I I I I I I I I : : 

Db 295 YGSRKASKSNSLGGESGDERRVSKQDREGLDPESLMFRDGRRKVDMVLAWEEEDLGVMTE 35 4 

Qy 126 QDSAARDRTDMHRTWRETFLDNLRAAGLCVDQQD-VQDGNTTVHYALLSASWAVLCYYAE 184 

: : I I I : I : : I I I I I :: I I I : : I : II 

Db 355 AEAKRRDN RRSFMENLIKEGLEVELEDKSQSFNEKTFFLKIHLPWRLETRLAE 40 7 

Qy 185 DLRLKLP LQELPNQASNWSAGLLAWLGIPNVLLEWPDVPP 225 

: I I I I I : : j | : : | | | 
Db 408 VMNLKLPVKRFITISVKPSWDEENWLRNMQYWKDVWQR-LTKKIQLDQTLLE GET 462 

Qy 226 EYYSCRFRVNKLPRFLGSDNQDTFFTSTKRHQILFEILAKTPYGHEKKNLLGIHQLLAEG 285 

: : I : I : I : I I I I : I : : : : I : I I : : : I I : I : : I 

Db 463 TFKAATANGNPEEQFIVKD-RATAFTSAQRSLMVMQVLIRTPFDESDRS— GIRRLMNDG 519 

Qy 286 VLSAAFPLHDGPFKTPPEGPQAPRLN-QRQVLFQHWARWGKWNKYQPLDHVRRYFGEKVA 3 44 

I I I I : I : : I : : : I : I I : I I I : I I I I I I I : I I I : I : I 

Db 520 TYLGCFPLHEGRY DRPHSSGISLDRRVLYQTWAHPSQWYKKQPLCLVRKYFGDKIA 575 

Qy 345 LYFAWLGFYTGWLLPAAWGTLVFLVGCFLVFSD— IPTQELCG— SKDSFEMCPLC-LD 399 

I I I I I I I I I I : 1111111:1 : I : I : : I : I : : I I I I 

Db 5 76 LYFCWLGFYTEMLVYPAWGTLCFIYGLATLESEDNTPSKEICNEYGTGNITLCPLCDKA 635 

Qy 400 CPFWLLSSACALAQAGRLFDHGGTVFFSLFMALWAVLLLEYWKRKSATLAYRWDCSDYED 459 

I : I I : I : : III: I I I I : : I I : I I I I I I I I : I : I I : I 
Db 636 CSYQRLSESCLFSRLTYLFDNPSTVFFAIFMSFWATTFLELWKRKQSVLVWEWDLHNV-D 6 94 

Qy 46 0 TEERPRPQFAASAPMTAPNPITGEDEPYFPERSRARRMLAGSVVIVVMVAWVMCLVSII 519 

: I I I : I : I I I : I I I I I : I : I : : : I : : || : : : I 

Db 695 MDEENRPEFETNATTFRMNPVTREKEPYMSTWNRSIRFVITGSAVLFMISWLSAVLGTI 754 

Qy 520 LYRAIMAIWSRSGNTLLAAWASRIASLTGSWNLVFILILSKIYVSLAHVLTRWEMHRT 5 79 

III : I : I : I I : I : : : I I I I : I I : : I I : I I I I I I 

Db 755 LYRITLVSVIYGGGGFFVKEHAKLFTSVTAALINLWIMILTRIYHRMAIKLTNLENPRT 814 
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Qy 58 0 QTKFEDAFTLKVFIFQFVNFYSSPVYIAFFKGRFVGYPGNYHT LFGVRNEECAAGG 635 

I : : I I : : I I : I I : I : I I I I I : I I I I I I I I I III: I : : I : I I 

Db 815 HTEYEDSYTFKIFFFEFMNFYSSLIYIAFFKGRFFDYPGDDQARKSEFFRLKNDICDPAG 8 74 

Qy 636 CLIELAQELLVIMVGKQVINNMQEVLIPKLKGWWQKFRLRSKKRKAGASAGASQGPWEDD 6 95 

II I I : I : I I I I I I II I I I I II: : I : : I III 

Db 875 CLSELCIQLAIIMVGKQCWNNFMEYLFPKFWNWWR QRKHKQATKDESHLHMAWEQD 930 

Qy 6 96 YELV-PCE-GLFDEYLEMVLQFGFVTIFVAACPLAPLFALLNNWVEIRLDARKFVCEYRR 753 

I : I I I I I I I I I : I I : I I II : I I I I I I I I I I I I I I I I I I I I I I I : I I 

Db 931 YHMQDPGRLALFDEYLEMILQYGFVTLFVAAFPLAPLFALLNNVAEIRLDAYKMVTQARR 990 

Qy 754 PVAERAQDIGIWFHILAGLTHLAVISNAFLLAFSSDFLPRAYYR — WTRAHDLRGFLNFT 811 

1:111:1111:11 : I : I I : I I I I : : I : : I I I : I I I : : : III:: : 
Db 991 PLAERVEDIGAWYGILRIITYTAWSNAFVIAYTSDFIPRMVYKFVYSETHTLAGYIEHS 1050 

Qy 812 LA RAPSSFAAAHNRTCRYRAFRDDDGHY SQTYWNLLAIRLAFVIVF 85 7 

I : : I : | | : | | : | : I I | | : : | | | | | | : | | 

Db 1051 LSIFNTSDYKEEWGASVSEKDPDTCQYRGYRNGPKDYEPYGLSPHYWHVFAARLAFWVF 1110 

Qy 85 8 EHVVFSVGRLLDLLVPDIPESVEIKVKREYYLAKQALAENEVLFGTNGTKDEQPKGSELS 917 

Mill: : : : : I I : I I : : : : I I I I I : I : : I I I : : 

Db 1111 EHVVFVITGIMQFIIPDVPSEVKTQMQREQLLAKEAKYQ HGIKRAQGDSQDIM 1163 

Qy 918 S 918 

I 

Db 1164 S 1164 



Search completed: June 24, 2008, 08:45:38 
Job time : 270 sees 
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